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Preface to the Second Edition 


The idea of the second edition was originally motivated by improvement of certain 
notation within the chapters and correcting various typos suggested by the readers. 
However during this work I decided to add several interesting theorems that were 
missing in the first edition. For the convenience of the readers who are familiar with 
the first edition I would like to underline here the main changes that were made. My 
intention was not to overload the book with new topics but rather to improve the 
exposition of the existing ones. 


First of all in Section 1.3 we relate partial numerators and partial denominators 
to the classical notion of continuants. We supplement numerous formulae via 
expressions in terms of continuants further in the text. 

We have added a criterion of rational angles congruence in Subsection 2.1.8 and 
of integer triangle convergence (Proposition 6.7). 

In the new Section 2.5 and Section 18.6 we show the classification of integer- 
regular polygons and polyhedra respectively. 

We have included an explicit expression for LLS sequences of adjacent angles in 
terms of certain long continued fractions (see Section 5.5). 


Two algorithms to compute LLS sequences are added to Chapter 4 (see Sec- 
tion 4.5). 


Finally, the chapter on Gauss Reduction Theory (Chapter 7 of the first edition of 


the book) was a subject of the major metamorphose. It was substantially revised and 
split into several new chapters: 


Markov numbers are discussed in a separate Chapter 7 now. 

The section on geometry of continued fractions is substantially extended to new 
Chapter 8. In particular we have added a new technique of computation of LLS 
sequence periods for GL(2, Z) matrices. 

Chapter 9 on continuant representations of GL(2,Z) matrices is new. It is very 
much in the spirit of Gauss Reduction Theory. 
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¢ The semigroup of reduced matrices is discussed separately in Chapter 10. 


¢ The remaining material (of Chapter 7 of the first edition of the book) is now 
placed in Chapter 11: here we have added proofs for elliptic and parabolic matri- 
ces and revised the main case of the hyperbolic matrices. Additionally we have 
extended the exposition to the group GL(2, Z) (originally it was mostly regarding 
SL(2,Z)). 


Further examples and exercises were added to different chapters of the book. 


University of Liverpool Oleg Karpenkov 
February 2022 
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Preface to the First Edition 


Continued fractions appear in many different branches of mathematics: the the- 
ory of Diophantine approximations, algebraic number theory, coding theory, toric 
geometry, dynamical systems, ergodic theory, topology, etc. One of the metamathe- 
matical explanations of this phenomenon is based on an interesting structure of the 
set of real numbers endowed with two operations: addition a+ b and inversion 1/b. 
This structure appeared for the first time in the Euclidean algorithm, which was 
known several thousand years ago. Similarly to the structures of fields and rings 
(with operations of addition a+b and multiplication a b), structures with addition 
and inversion can be found in many branches of mathematics. That is the reason why 
continued fractions can be encountered far away from number theory. In particular, 
continued fractions have a geometric interpretation in terms of integer geometry, 
which we place as a cornerstone for this book. 


The main goal of the first part of the book is to explore geometric ideas behind 
regular continued fractions. On the one hand, we present geometrical interpretation 
of classical theorems, such as the Gauss—Kuzmin theorem on the distribution of el- 
ements of continued fractions, Lagrange’s theorem on the periodicity of continued 
fractions, and the algorithm of Gaussian reduction. On the other hand, we present 
some recent results related to toric geometry and the first steps of integer trigonom- 
etry of lattices. The first part is rather elementary and will be interesting for both 
students in mathematics and researchers. This part is a result of a series of lecture 
courses at the Graz University of Technology (Austria). The material is appropriate 
for master’s and doctoral students who already have basic knowledge of linear alge- 
bra, algebraic number theory, and measure theory. Several chapters demand certain 
experience in differential and algebraic geometry. Nevertheless, I believe that it is 
possible for strong bachelor’s students as well to understand this material. 


In the second part of the book we study an integer geometric generalization of 
continued fractions to the multidimensional case. Such a generalization was first 
considered by F. Klein in 1895. Later, this subject was almost completely abandoned 
due to the computational complexity of the structure involved in the calculation 
of the generalized continued fractions. The interest in Klein’s generalization was 
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revived by V.I. Arnold approximately one hundred years after its invention, when 
computers became strong enough to overcome the computational complexity. After 
a brief introduction to multidimensional integer geometry, we study essentially new 
questions for the multidimensional cases and questions arising as extensions of the 
classical ones (such as Lagrange’s theorem and Gauss—Kuzmin statistics). This 
part is an exposition of recent results in this area. We emphasize that the majority of 
examples and even certain statements of this part are on two-dimensional continued 
fractions. The situation in higher dimensions is more technical and less studied, 
and in many cases we formulate the corresponding problems and conjectures. The 
second part is intended mostly for researchers in the fields of algebraic number 
theory, Diophantine equations and approximations, and algebraic geometry. Several 
chapters of this part can be added to a course for master’s or doctoral students. 


Finally, I should mention many other interesting generalizations of continued 
fractions, coming from algorithmic, dynamical, and approximation properties of 
continued fractions. These generalizations are all distinct in higher dimensions. We 
briefly describe the most famous of them in Chapter 27. 


University of Liverpool Oleg Karpenkov 
February 2013 


Contents 


Part I Regular Continued Fractions 


1 


Classical Notions and Definitions...............0... 0.000000 cece 


1.1 


1.2 
1.3 
1.4 


1.5 
1.6 
1.7 


1.1.1 Definition of a continued fraction...................00.0. 
1.1.2 Regular continued fractions for rational numbers .......... 
1.1.3. Regular continued fractions and the Euclidean algorithm ... 
1.1.4 Continued fractions with arbitrary elements .............. 
Convergence of infinite regular continued fractions .............. 
Continuants! 356.50 sng eee ie yobees bgaron deed bee yes iw 
Existence and uniqueness of a regular continued fraction for a 

given real NUMDED: saxon oes eis Gk PAS eas 
Monotone behavior of convergents ............. 0.00.0 c eee ee eee 
Approximation rates of regular continued fractions .............. 
XCICISCS punishing eaten? Hee gece Ben ett ame eens 


On Integer Geometry .......... 00... cece eee eee ene nee 


2.1 


2.2 
2.3 
2.4 
2.5 
2.6 


Basic notions and definitions .................. 00. eee eee eee eee 
2.1.1 Objects and congruence relation of integer geometry ...... 
2.1.2 Invariants of integer geometry.............. 00. ee eee eee 
2.1.3 Index of sublattices ...... 0.2 eee ee 
2.1.4 Integer length of integer segments ....................4. 
2.1.5 Integer distance to integer lines.....................00.. 
2.1.6 Integer area of integer triangles.....................0... 
2.1.7. Index of rational angles ................ 0.00.0 e eee ee eee 
2.1.8 Congruence of rational angles.................0.....00.. 
Empty triangles: their integer and Euclidean areas ............... 
Integer area of polygons .......... 2... cece cece eee eee 
Pick’sformula’.....ces sang eee aia beatae he SA ee eS 
Integer-regular polygon ....... 0... cece cece eee eee eee 
The twelve-point theorem... ...... 0... cece eee eee eee 


xi 


Contents 
237 OEXGICIseS 4s hi esa oe be ass sea ea eee pee Gaels ces 35 
Geometry of Regular Continued Fractions ........................ 37 
3.1. Classical construction ......... 0.0 oe cee eee eee 37 
3.2 Geometric interpretation of the elements of continued fractions .... 41 
3.3 Index of an angle, duality of sails ...................0.0.00 000. 42 
BA. (EXELCISES «rig eee sash x et Ae eae ROE ees 43 
Complete Invariant of Integer Angles ..........................0. 45 
4.1 Integer sines of rational angles......................000 000000. 45 
4.2 Sails for arbitrary angles and their LLS sequences ............... 46 
4.3. Oncomplete invariants of angles with integer vertex ............. 48 
4.4 Equivalent tails of the angles sharing an edge ................... 50 
4.5 Two algorithms to compute the LLS sequence of an angle......... 31 
4.5.1 Brute force algorithm...................0... 0. eee ee eee 51 
4.5.2 Explicite formulae for LLS sequences via given 
coordinates of the angle................... 0... eee ee eee 53 
4:0° sEXCLCISES: ie. o55-f ak hecnele ba Ele wee Heed hobs Ca eb eae BoA bd 55 
Integer Trigonometry for Integer Angles..................-.-0000- 37 
5.1 Definition of trigonometric functions ...................00.000. 57 
5.2 Basic properties of integer trigonometry.....................00. 58 
5.3. Transpose integer angles............ 0... eee ee eee eee 59 
5.4 Adjacent integer angles.......... 0... eee eee eee 61 
5.5. LLS sequences for adjacent angles ....................000.000. 64 
5.6 Right integer angles: 22.6. .000 06. cis sesh ade besa eee sees 67 
5.7 Opposite interior angles .......... 0.0... eee eee eee 68 
3.8: (EXCLciSeS. orcs cb oot akgs nee te Aka Seta as Sa dad eke 68 
Integer Angles of Integer Triangles.................. 0. esses eee ees 69 
6.1 Integer sine formula ............ 0... eee ee eee 69 
6.2 On integer congruence criteria for triangles ..................... 70 
6.3 Onsums of angles in triangles ............... 0.0.0... eee ee eee 72 
6.4 Angles and segments of integer triangles ....................... 74 
6.5 Examples of integer triangles.................... 0.000002 eee, 75 
6:6: /EXELrcIseS) jc. bi send ie ced wets dow Sayeed aa daetewsae 78 
Quadratic forms and Markov Spectrum ...............-...0000 00 79 
7.1 Calculation of minima of quadratic forms ...................... 719 
7.2 Some properties of Markov spectrum ...................002000. 80 
7.3. Markov numbers ........... 0.00. eee eee 82 
7.4 Markov—Davenport characteristic ............... 00.0002 84 


P25: TE XE CISE Sisco eek coh eee aca ae pce ah ote eahea Lech oom peadag sab nena eam 86 


Contents xiii 


8 


10 


11 


Geometric Continued Fractions........................0..0000005 87 
8.1 Definition of a geometric continued fraction .................... 87 
8.2 Geometric continued fractions of hyperbolic GL(2,R) matrices.... 88 
8:3" Duality of sails:.3)s0 vag dpa gies a enh see data gt adeweee tebe as 88 
8.4 LLS sequences for hyperbolic matrices ....................00.. 90 
8.5 Algebraic sails and their LLS cycles......................000.. 90 
8.5.1. Algebraicisails: 2 ii. oe yee eds be Seater RS 90 
8.5.2 LLS periods and LLS cycles of GL(2,Z) matrices......... 90 
8.6 Computing LLS cycles of GL(2,Z) matrices.................4.. 93 
8.6.1 Differences of sequences ..................00 00. 93 
8.6.2 LLS cycles for SL(2,Z) matrices with positive eigenvalues . 93 
8.6.3. LLS cycles for GL(2,Z) matrices .............0. 2.0000 95 
82:7) (EXercisesS. 2. .h seid Reeder ptd eee hen deeddcedd Mawes 96 
Continuant Representation of GL(2,Z) Matrices................... 97 
9.1 Generators of SL(2,Z) and the modular group .................. 97 
9.2 Basic properties of matrices Mg, ay ee eee cee ene 98 
9.3 Matrices of GL(2,Z) in terms of continuants.................... 100 
9.4 Anexpression of matrices in terms of Ms and My................ 105 
OD 'EXELciSeS, fates eet ishae een ee hee Seabee wit ea nace ldo 106 
Semigroup of Reduced Matrices .........................0000000- 107 
10.1 Definition and basic properties of reduced matrices .............. 107 
10.1.1 Reduced matrices ........ 0... ee eee eee 107 
10.1.2 Continuant representations of reduced matrices ........... 108 
10.1.3 A necessary and sufficient condition for a matrix to be 
TECUCEd: é..heh octane hae ta aed esdd el noe tenes 110 
10.1.4 LLS cycles of reduced matrices ....................00.. 112 
10.2 Existence of reduced matrices in every integer conjugacy class of 
GE(OED), Baek occ naacarts Chet ceca atau eeaae ae nate adh eed ines 113 
103 EX€LciseS: \..cs cya dah ototees eheben ora pesied ender dialedd Mates ie 114 
Elements of Gauss Reduction Theory..........................0-. 115 
11.1 Conjugacy classes of GL(2,Z) in general .................000-. 115 
LL.D EM pte: Case: joes es coat tid ls Sa ais Boe Eatin d etal eo hess 116 
11.3. Parabolic Case: o..4.5 66 Gratton oie ged dene bedside ciaeesds 118 
114 “Hyperbolieicase: oc. ys6 0 os hha a RAS eae aS Maes 121 
11.4.1 The set of reduced matrices integer conjugate to a given one 121 
11.4.2 Complete invariant of integer conjugacy classes........... 122 
11.4.3 Algebraicity of matrices with periodic LLS sequences ..... 122 
11.5 Computation of all reduced matrices integer conjugate to a given one 123 
11.5.1 Explicit computation via LLS cycles .................... 123 
11.5.2 Algorithmic computation: Gauss Reduction theory ........ 124 
11.6 Statistical properties of reduced SL(2,Z) matrices ............... 125 


11.6.1 Complexity of reduced matrices ....................00.. 125 


12 


13 


14 


Contents 
11.6.2 Frequencies of reduced matrices....................00.. 126 
Tid 7 “EXERCISES scp fe tog ees WAS EH eh ee ee Te LS area ee wie bee 127 
Lagrange’s Theorem ............... 0. cece cece eee eect e een enneee 129 
12.1 The Dirichlet group......... 0.0... eee eee eee 129 
12.2 Construction of the integer nth root of aGL(2,Z) matrix ......... 132 
12.3": Pell’ s:€quation ss. .iaa Benes Vinee nee te Nee eo ews es 133 
12.4 Periodic continued fractions and quadratic irrationalities.......... 136 
12:3: BX€fciseS i004 36.444-5e Rees ela abe teenie Ue eed ea ed eed ew 138 
Gauss—Kuzmin Statistics. ............0... 00.0. 139 
13.1 Some information from ergodic theory ....................0000- 140 
13.2 The measure space related to continued fractions ................ 141 
13.2.1 Definition of the measure space related to continued fractions 141 
13.2.2 Theorems on density points of measurable subsets ........ 141 
13.3 On the Gauss map ....... 0... eee eens 142 
13.3.1 The Gauss map and corresponding invariant measure ...... 142 
13.3.2 An example of an invariant set for the Gauss map ......... 144 
13.3.3 Ergodicity of the Gauss map....................000000- 145 
13.4 Pointwise Gauss—Kuzmin theorem ............ 0.0. e eee eee ee 147 
13.5 Original Gauss—Kuzmin theorem ....................0020000. 148 
13.6 Cross-ratio in projective geometry ............... 0... ee eee eee 148 
13.6.1 Projective linear group......... 02... eee ee eee eee 149 
13.6.2 Cross-ratio, infinitesimal cross-ratio..............0 00 cee 149 
13.7 Smooth manifold of geometric continued fractions............... 150 
13.8 Mobius measure on the manifolds of continued fractions.......... 150 
13.9 Explicit formulas for the Mébius form .....................000. 151 
13.10Relative frequencies of edges of one-dimensional continued 
TPACH ONS, 420.5563 OSE Nicos esis Maeda Ge bane Oa he bie bean 152 
13: DLE XEGCISES isin’ silos ac sepeheia eines aes, Oa eeehs Whee Graton. ee ee aA woud 154 
Geometric Aspects of Approximation. ...........................0. 155 
14.1 Two types of best approximations of rational numbers............ 155 
14.1.1 Best Diophantine approximations...................200- 155 
14.1.2 Strong best Diophantine approximations ................. 160 
14.2 Rational approximations of arrangements of two lines ............ 162 
14.2.1 Regular angles and related Markov—Davenport forms ..... 163 
14.2.2 Integer arrangements and their sizes ................000- 164 
14.2.3. Discrepancy functional and approximation model ......... 164 
14.2.4 Lagrange estimates for the case of continued fractions 
with bounded elements ............. 2... eee eee eee eee 165 
14.2.5 Periodic sails and best approximations in the algebraic case . 169 
14.2.6 Finding best approximations of line arrangements ......... 171 


1°43) (EXELCISES: a-s99 002 co sdeco ties ah bes bee Benoa $e da ta ak tables aa bwid beers 174 


Contents Xv 


15 Geometry of Continued Fractions with Real Elements and Kepler’s 


Second Law? sesh ook ketene cain oe hee nse Seed ete ae 177 
15.1 Continued fractions with integer coefficients .................... 177 
15.2 Continued fractions with real coefficients ...................... 179 
15.2.1 Broken lines related to sequences of arbitrary real numbers . 180 
15.2.2 Continued fractions related to broken lines ............... 182 
15.2.3. Geometry of continued fractions for broken lines.......... 182 
15.2.4 Proof of Theorem 4.16............... 0... eee eee eee 186 

15.3 Areal and angular densities for differentiable curves ............. 188 
15.3.1 Notions of real and angular densities .................... 188 
15.3.2 Curves and broken lines................ 00.0.0. e eee eee 190 
15.3:3° Some examples 'siiie sh ceod 0s awk veh yew dest ewr slim gsias 191 

1554 EXGrciséS) 5, sos O86 skates heen vere ta daleed. cnoidal heh diye des 193 
16 Extended Integer Angles and Their Summation ................... 195 
16.1 Extension of integer angles. Notion of sums of integer angles...... 195 
16.1.1 Extended integer angles and revolution number ........... 196 
16.1.2 On normal forms of extended angles .................... 199 
16.1.3 Trigonometry of extended angles. Associated integer angles 203 
16.1.4 Opposite extended angles ......................000000. 203 
16.1.5 Sums of extended angles ..................... 00002 204 
16.1.6 Sums of integer angles...............0.... 0.00. e eee eee 205 

16.2 Relations between extended and integer angles .................. 205 
16.3 Proof of Theorem 6.9(1) ......... 0... cc cc cece eee eens 206 
16.3.1 Two preliminary lemmas.................... 0000000005 207 
16.3.2 Conclusion of the proof of Theorem 6.9(i) ............... 211 

164 (Exercises: 2.6 5:c5cc6 bot ei tesstbis (ge eee EE Ce a eee eae 213 

17 Integer Angles of Polygons and Global Relations for Toric 

ROTA TL 1 P21 18 (a egg ee eee 215 
17.1 Theorem on angles of integer convex polygons .................. 215 
17.2 Toric surfaces and their singularities ..................00 000 ee 217 
17.2.1 Definition of toric surfaces ............. 00.0.0 eee ee eee 217 
17.2.2 Singularities of toric surfaces ..............0...0000 008) 218 

17.3 Relations on toric singularities of surfaces ...................0.. 220 
17.3.1 Toric singularities of n-gons with fixed parametern........ 220 
17.3.2 Realizability of a prescribed set of toric singularities....... 221 

17:4 EX€LCISES. biciehet wie Seeds Oh nee ON Soin Eee 223 


Part II Multidimensional Continued Fractions 


18 Basic Notions and Definitions of Multidimensional Integer Geometry 229 
18.1 Basic integer invariants in integer geometry..................... 229 
18.1.1 Objects and the congruence relation..................... 229 

18.1.2 Integer invariants and indices of sublattices .............. 230 


19 


20 


Contents 
18.1.3 Integer volume of simplices ....................0.0000. 231 
18.1.4 Integer angle between two planes...................0... 231 
18.1.5 Integer distance between two disjoint planes.............. 232 
18.2 Integer and Euclidean volumes of basis simplices................ 233 
18.3 Integer volumes of polyhedra................ 0.0.0... e eee eee 235 
18.3.1 Interpretation of integer volumes of simplices via 

Euclidean volumes ............ 0.00. 0c cece eee eee eee 236 
18.3.2 Integer volume of polyhedra .....................0000.. 236 
18.3.3. Decomposition into empty simplices .................... 237 

18.4 Lattice Pliicker coordinates and calculation of integer volumes of 
SUMP] COS” aca nh whiten des Cen NOMEN SME eae RAED SSeS 238 
18.4.1 Grassmann algebra on R” and k-forms .................. 238 
18.4.2 Pliicker coordinates ......... 0... eee eee 239 
18.4.3 Oriented lattices in R” and their lattice Plticker embedding . 240 
18.4.4 Lattice Pliicker coordinates and integer volumes of simplices 241 
18.5 Ehrhart polynomials as generalized Pick’s formula............... 242 
18.6 Integer-regular polyhedra ........... 0... eee eee eee 244 
18.6.1 Definition of integer-regular polyhedra .................. 244 
18.6.2 Schlaflisymbols ............ 0... eee eee 245 
18.6.3 Euclidean regular polyhedra .......................00.. 246 
18.6.4 Preliminary integer notation .....................00000. 247 
18.6.5 Integer-regular polyhedra in arbitrary dimensions ......... 249 
18.7 EXGIcises. #205. chat chee bee deenc ee isda eth ee eames 249 
On Empty Simplices, Pyramids, Parallelepipeds ................... 253 
19.1 Classification of empty integer tetrahedra ...................... 253 
19.2 Classification of completely empty lattice pyramids.............. 254 
19.3. Two open problems related to the notion of emptiness ............ 256 
19.3.1 Problem on empty simplices .......................00.. 256 
19.3.2 Lonely runner conjecture.................. 020 eee ee eee 257 

19.4 Proof of White’s theorem and the empty tetrahedra classification 
TheOreMS sso. cleat EG Rea dk chee ne eAGalean sa 258 
19:4.) ID@-system «sce os cece bee ee Ao oe dee ee ae 258 
19.4.2 A lemma on sections of an integer parallelepiped ......... 259 

19.4.3 A corollary on integer distances between the vertices and 
the opposite faces of a tetrahedron with empty faces ....... 259 
19.4.4 Lemma on one integer node ....................000000. 260 
19.4.5 Proof of White’s theorem ................. 0.0.00. e eee 261 
19.4.6 Deduction of Corollary 19.3 from White’s theorem........ 263 
19:5) EX@LCISeS: cei h che ed hohe dave e orto dase cA bodes begat 264 
Multidimensional Continued Fractions in the Sense of Klein ........ 265 
20:1. Background: s.s23 atte ioe ree bated eee 265 
20.2 Some notation and definitions ................... 0... eee 266 


20.2.1 A-hulls and their boundaries .............. 0.00.0 e ce eee 266 


Contents Xvii 


21 


22 


20.2.2 Definition of multidimensional continued fraction in the 


sense:0f Klein o 2.8 ce0 cen chow Gee erin hie Cie ROE EE Ss 267 
20.2.3 Face structure of sails..... 0... eee eee eee 267 
20.3 Finite continued fractions ......... 0.0.0.0. cee eee eee eee 268 
20.4 Ona generalized Kronecker’s approximation theorem ............ 269 
20.4.1 Addition of sets in R? 2.0... cece eee eee eee 269 
20.4.2 Integer approximation spaces and affine irrational vectors .. 270 
20.4.3 Formulation of the theorem.....................000000. 271 
20.4.4 Proof of the Multidimensional Kronecker’s approximation 
fi 110) 2c) 16 UR ee 271 
20.5 Polyhedral structure of sails ......... 0.00... eee eee eee 274 
20.5.1 The intersection of the closures of A-hulls with faces of 
corresponding ConeS......... 0.0... eee eee ee eee 275 
20.5.2 Homeomorphic types of sails .................-.000000- 277 
20.5.3, Combinatorial structure of sails for cones in general position 278 
20.5.4 A-hulls and quasipolyhedra.....................000000. 282 
20.6 Two-dimensional faces of sails ......... 00.0... eee eee eee eee 282 
20.6.1 Faces with integer distance to the origin equal one......... 282 
20.6.2 Faces with integer distance to the origin greater than one . . . 284 
20.7 | EXELCISES: Kaki ree etd hast Mea whee yes each eee 284 
Dirichlet Groups and Lattice Reduction ....................-0005- 287 
21.1 Orders, units, and Dirichlet’s Unit Theorem..................... 287 
21.2 Dirichlet groups and groups of units in orders ................... 288 
21.2.1 Notion of a Dirichlet group ......................00000. 288 
21.2.2 On isomorphisms of Dirichlet groups and certain groups 
OLUNIUS id doe vets Let eee ond ear eee sttad 288 
21.2.3 Dirichlet groups related to orders that do not have complex 
FOOtS-OF UNITY eco ocean wee anions La wd ea 290 
21.3 Calculation of a basis of the additive group '(A) ................ 291 
21.3.1 Step 1: preliminary statements .....................00.. 291 
21.3.2 Step 2: application of the LLL-algorithm ................ 292 
21.3.3 Step 3: calculation of an integer basis having a basis of an 
integer sublattice... 2... eee ee eee 292 
21.4 Calculation of a basis of the positive Dirichlet group 2,(A)....... 293 
21.5 Lattice reduction and the LLL-algorithm ....................... 293 
J2V3A Reduced: bases isiivey ek aah yeaheah ae i 294 
21.5.2 The LLL-algorithm ................. 0. cee eee eee 295 
21 GSEXELCISES: «22s pei nieet Abe Fpls eet s eels oe da be deere seehe ie 296 
Periodicity of Klein polyhedra. Generalization of Lagrange’s 
Whe Oren... o..cc.0eGes ek Ser Sones eee yen oe ant Sarees wl 297 
22.1 Continued fractions associated to matrices...................00- 297 
22.2 Algebraic periodic multidimensional continued fractions.......... 298 


22.3 Torus decompositions of periodic sails in R?.................00. 299 


XVili 


23 


24 


Contents 
22.4 Three single examples of torus decompositions in R? ............ 300 
22.5 Examples of infinite series of torus decomposition ............... 304 
22.6 Two-dimensional continued fractions associated to transpose 
Frobenius normal forms ............ 00... e cece eee eee eee 307 
22.7 Some problems and conjectures on periodic geometry of algebraic 
SaUlSrii, cee ead ons hi oh EAN oes cade aa hiey Bio A 309 
22.8 Generalized Lagrange’s Theorem .....................000 0000. 311 
22.9 Littlewood and Oppenheim conjectures in the framework of 
multidimensional continued fractions ......................000- 314 
22 LOEXELCISES. 000 25 05 oa SAS LA Sa aS «Baayen tan eile ad 315 
Multidimensional Gauss—Kuzmin Statistics ...................... 319 
23.1 Mobius measure on the manifold of continued fractions .......... 319 
23.1.1 Smooth manifold of n-dimensional continued fractions... .. 319 
23.1.2 Mobius measure on the manifolds of continued fractions ... 320 
23.2 Explicit formulae for the MObius form...................0..00. 321 
23.3 Relative frequencies of faces of multidimensional continued 
fractiONS: «2/5. cckdd eae cae e ah Shae et odes doe d Saeed 323 
23.4 Some calculations of frequencies for faces in the two-dimensional 
CAS Cree eee tiaras ewan mai ah ean yatean ite Gite hb may ete ew elena aaa at 324 
23.4.1 Some hints for computation of approximate values of 
relative frequencies ...... 0.0... . eee eee eee ee 324 
23.4.2 Numeric calculations of relative frequencies.............. 325 
23.4.3 Two particular results on relative frequencies ............. 327 
235. EXELCISES ie sas ty tee ie ee a isla eh oe Oeste at eS 328 
On the Construction of Multidimensional Continued Fractions...... 329 
24.1 Inductive algorithm ......... 0.0.0... eee eee 329 
24.1.1 Some background........... 00.0... eee eee eee 329 
24.1.2 Description of the algorithm .......................00.. 330 
24.1.3 Step la: construction of the first hyperface ............... 331 
24.1.4 Step 1b, 4: how decompose the polytope into its faces ..... 332 
24.1.5 Step 2: construction of the adjacent hyperface ............ 332 
24.1.6 Step 2: test of the equivalence class for the hyperface F’ to 
have representatives in the set of hyperfaces D............ 333 
24.2 Deductive algorithms to construct sails..................000000. 334 
24.2.1 General idea of deductive algorithms .................... 334 
24.2.2 The first deductive algorithm ......................00.. 334 
24.2.3 The second deductive algorithm ....................0... 335 
24.2.4 Test of the conjectures produced in the two-dimensional case338 
24.2.5 On the verification of a conjecture for the multidimensional 
CASE Medecine tenes oP ecto iodeny sate Meceeitaty eee te Ciera 343 
24.3 An example of the calculation of a fundamental domain .......... 344 


DAMA SERELCISES” Sree 84.5 Poe Ske Fen ew te ahs OS ad Wn eae tb Garden eae es 348 


Contents XIX 
25 Gauss Reduction in Higher Dimensions ........................... 349 
25.1 Organization of this chapter ............ 0.0.0... e eee eee eee 349 
25.2 Hessenberg matrices and conjugacy classes..................0.. 350 
25.2.1 Notions and definitions ......... 0.0... cee eee eee eee eee 351 
25.2.2 Construction of perfect Hessenberg matrices conjugate to 
A SIVEMONE! 2 cache yids MeN i ht A ea eae 353 
25.2.3 Existence and finiteness of ¢-reduced Hessenberg matrices . 355 
25.2.4 Families of Hessenberg matrices with given Hessenberg type 356 
25.2.5 ¢-reduced matrices in the 2-dimensional case............. 358 
25.3 Complete geometric invariant of conjugacy classes .............. 359 
25.3.1 Continued fractions in the sense of Klein—Voronoi ....... 360 
25.3.2 Geometric complete invariants of Dirichlet groups ........ 363 
25.3.3, Geometric invariants of conjugacy classes ............... 365 
25.4 Algorithmic aspects of reduction to ¢-reduced matrices........... 366 
25.4.1 Markov—Davenport characteristics .................00.. 366 
25.4.2 Klein—Voronoi continued fractions and minima of 
MD-characteristics ..... 0... eee eee eee eee 369 
25.4.3 Construction of ¢-reduced matrices by Klein—Voronoi 
continued fractions .......... 0. cece cece eee eee eee 370 
25.5 Diophantine equations related to the Markov—Davenport 
ChALAClENSUC) h.5 cS aeandi ans Gano aes han adies bop wees 372 
25.5.1 Multidimensional w-sails and w-continued fractions ....... 372 
25.5.2 Solution of Equation 25.1 ......... 0.0 eee eee eee 374 
25.6 On reduced matrices in SL(3,Z) with two complex conjugate 
eigenvalues satin thoee aie wee tae at 374 
25.6.1 Perfect Hessenberg matrices of a given Hessenberg type... . 375 
25.6.2 Parabolic structure of the set of NRS-matrices ............ 375 
25.6.3 Theorem on asymptotic uniqueness of ¢-reduced 
NRS- Matrices nat ieee dak Soe ee a ee eee eee 377 
25.6.4 Examples of NRS-matrices for a given Hessenberg type... . 379 
25.6.5 Proof of Theorem 25.43........ 0.0.0.0. eee eee eee 381 
25.6.6 Proof of Theorem 25.48 ........ 0.0.0... cece eee eee 384 
25.7 Open problems w-.s..4 oe xtiei gee ak ee tees eee eee Boe peed oes Pe wh 390 
29.8 EXELCISES isc bie Geiss ored hci hee 8 Sh bial eee Single area lade ees 392 
26 Approximation of Maximal Commutative Subgroups .............. 395 
26.1 Rational approximations of MCRS-groups ...................-- 395 
26.1.1 Maximal commutative subgroups and corresponding 
simplicial cones ......... 0. cece eee eee eee eens 396 
26.1.2 Regular subgroups and Markov—Davenport forms ........ 397 
26.1.3 Rational subgroups and their sizes ...............002-005 398 
26.1.4 Discrepancy functional ......... 0.0... cee eee eee eee 399 
26.1.5 Approximation model .......... 20.0... cee eee eee eee ee 400 
26.1.6 Diophantine approximation and MCRS-group approximation400 
26.2 Simultaneous approximation in R* and MCRS-group approximation 401 


XX Contents 
26.2.1 General construction ........ 20.0... c cece ees 401 
26.2.2 A ray of anonreal spectrum operator ...............2--45 402 
26.2.3 Two-dimensional golden ratio. ........ 0... eee eee eee 403 
26:3: JEXELCISES? s.cnle ee fes.cte thst bie S tenleh stln ota ela et oade dea eyutee 404 
27 Other Generalizations of Continued Fractions..................... 405 
27.1 Relative minima......... 0.2.0... eee eee ee eee 405 
27.1.1 Relative minima and the Minkowski—Voronoi complex... . 406 
27.1.2 Minkowski—Voronoi tessellations of the plane ........... 408 
27.1.3 Minkowski—Voronoi continued fractions in R?........... 409 
27.1.4 Combinatorial properties of the Minkowski—Voronoi 

tessellation for integer sublattices....................0.. 410 
27.2 Farey addition, Farey tessellation, triangle sequences............. 412 
27.2.1 Farey addition of rational numbers.................2.-4. 412 
27.2.2. Farey tessellation 2.3... ccs beets Sak SASS SO A Sas 413 
27.2.3 Descent toward the absolute .....................0000.. 414 
27.2.4 Triangle sequences ............ 0. cece eee 416 

27.3, Decompositions of coordinate rectangular bricks and O’ Hara’s 
AlSOrith Ms .s5. ses khs cake stoke Aas bs hag ghe ead deals es 420 
27.3.1 II-congruence of coordinate rectangular bricks ........... 421 
27.3.2 Criteron of I[-congruence of coordinate bricks ........... 422 
27.3.3, Geometric version of O’Hara’s algorithm for partitions .... 423 
27.4 Algorithmic generalized continued fractions .................... 425 
27.4.1 General algorithmic scheme ......................0000. 425 
27.4.2 Examples of algorithms ........................000 000. 426 
27.4.3 Algebraic periodicity .......... 0... eee eee eee 428 
27.4.4 A few words about convergents ....................00.. 428 
27.5 Branching continued fractions .............. 0.0... e cece eee eee 429 
27.6 Continued fractions and rational knots and links ................. 433 
27.6.1 Necessary definitions .......... 0.0.0... eee eee 433 
27.6.2 Rational tangles and operations on them ................. 434 
27.6.3 Main results on rational knots and tangles................ 435 
References.) yes he ea ee Eh ey Be eg 437 
INDEX! obs eed bed aieeid eid, cae hha en ead ee Air hae ett teed aedds 447 


Part I 
Regular Continued Fractions 


In the first part of this book we study geometry of continued fractions in the plane. 
As usually happens for low dimensions, this case is the richest, and many interest- 
ing results from different areas of mathematics show up here. While trying to prove 
analogous multidimensional statements, several research groups have invented com- 
pletely different multidimensional analogues of continued fractions (see, e.g., Chap- 
ter 27). This is due to the fact that many relations and regularities that are similar in 
the planar case become completely different in higher dimensions. So we start with 
the classical planar case. 


The first two chapters are preliminary. Nevertheless, we strongly recommend that 
the reader look through them to get the necessary notation. We start in Chapter 1 
with a brief introduction of the notions and definitions of the classical theory of 
continued fractions. Further, in Chapter 2, we shed light on the geometry of the 
planar integer lattice. We present this geometry from the classical point of view, 
where one has a set of objects and a group of congruences acting on the set of 
objects. This approach leads to natural definitions of various integer invariants, such 
as integer lengths and integer areas. 


In Chapter 3 we present the main geometric construction of continued fractions. 
It is based on the notion of sails, which are attributes of integer angles in integer 
geometry. 


In the next three chapters we come to the problem of the description of integer 
angles in terms of integer parameters of their sails. We start in Chapter 4 with a 
construction of a complete invariant of integer angles (it is written in terms of inte- 
ger characteristics of the corresponding sails). Further, in Chapter 5, we use these 
invariants to define trigonometric functions for integer angles in integer geometry. 
These functions have many nice properties similar to Euclidean trigonometric func- 
tions, while the others are totally different. For instance, there is an integer analogue 
of the formula «@ + 8 + y = a for the angles in a triangle in the Euclidean plane, 
which we prove in Chapter 6. 


Further in Chapter 7 we briefly discuss discrete Markov spectrum and introduce 
the notion of Markov—Davenport characteristics for matrices. 


In Chapter 8 we introduce a notion of geometric continued fractions related to 
arrangements of pairs of lines passing through the origin in the plane. One can con- 
sider these arrangements to be invariant lines of certain real two-dimensional matri- 
ces. So geometric continued fractions give rise to powerful invariants of conjugacy 
classes of matrices. They are especially useful for the study of GL(2,Z) matrices 
when the geometric continued fractions have a rich structure of automorphisms. We 
discuss both theoretical and computational aspects in the chapter. 


We slightly touch the subject of the structure of the groups SL(2, Z) and GL(2, Z) 
in Chapter 9. First we discuss the classic representation, when the group SL(2,Z) 
is generated by two elements. Further we introduce a continuant description of 
GL(2,Z) matrices and show how to use it in order to write the elements of SL(2,Z) 
in terms of these two generators. 


Further in Chapter 10 we introduce a semigroup of reduced matrices and study 
its basic properties. In this chapter we collect some preliminary material for Gauss 
Reduction Theory discussed in the next chapter. 


Chapter 11 is dedicated to the study of integer conjugacy classes of GL(2, Z) ma- 
trices. Here we separate the following three distinct cases: elliptic (complex spectra) 
matrices; parabolic (real rational spectra) matrices; and hyperbolic (real irrational 
spectra) matrices. The last case is a little more advanced. It gives rise to a famous 
Gauss Reduction Theory, which is could be seen as a certain Euclidean algorithm 
on the space of GL(2,Z) matrices. We discuss the geometric approach to the Gauss 
Reduction Algorithm. We conclude this chapter with several statistical etudes. 


In Chapter 12 we focus on the classical question related to Lagrange’s theorem on 
the periodicity of continued fractions for quadratic irrationalities, which is based on 
the structure of the set of solutions of Pell’s equations. In this chapter we also discuss 
a few questions related to Dirichlet groups of matrices, in particular a problem of 
solving the equation X" = A in GL(n,Z). 


We continue in Chapter 13 with a classical question of Gauss—Kuzmin statistics 
for the distribution of elements of continued fractions for arbitrary integers. After 
the introduction of the classical approach via ergodic theory, we present a geomet- 
ric meaning of the Gauss—Kuzmin statistics in terms of cross ratios in projective 
geometry. 


Regular continued fractions are very good approximations to real numbers. Since 
there is a vast number of publications on this subject, we do not attempt to cover all 
interesting problems on continued fractions related to approximations. In Chapter 14 
we discuss two questions related to best approximations of real numbers and of 
arrangements of lines in the plane. 


In the last three chapters of the first part we introduce applications of contin- 
ued fractions to differential and algebraic geometry. In Chapter 15 we introduce an 
“infinitesimal” version of continued fractions arising as sails of planar curves. We 
show the relation of these fractions to the motion of a body along this curve satisfy- 
ing Kepler’s second law. Further, in Chapter 16, we give a supplementary definition 
of extended integer angles, which we use to prove the formula on the sum of inte- 
ger angles in a triangle. In addition, we introduce a definition of the sum of integer 
angles, which is a good candidate for the correct addition operation on continued 
fractions. Finally, in Chapter 17, we describe the global relations on singular points 
of toric surfaces via integer tangents of the integer angles of polygons associated to 
these surfaces. 


Chapter 1 
Classical Notions and Definitions 


| ® 
Check for 
updates 


In this chapter we bring together some basic definitions and facts on continued frac- 
tions. After a small introduction we prove a convergence theorem for infinite regular 
continued fractions. Further, we prove existence and uniqueness of continued frac- 
tions for a given number (odd and even continued fractions in the rational case). 
Finally, we discuss approximation properties of continued fractions. For more de- 
tails on the classical theory of continued fractions we refer the reader to [116]. 


1.1 Continued fractions 


1.1.1 Definition of a continued fraction 


We start with basic definitions in the theory of regular continued fractions. 


Definition 1.1. Let @, ao,...,a, be real numbers satisfying 


a@=agt+ 


a, + ———_._— 


: 1 
an 


(1.1) 


The expression in the right-hand side of the equality is called a continued fraction 
expansion of a given number & (or just a continued fraction of a), and denoted by 
[a0;a1 : ++: d,|. The numbers ao,...,dy are the elements of this continued fraction. 


Definition 1.2. A continued fraction is odd (even) if it has an odd (even) number of 


elements. 
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Definition 1.3. An infinite continued fraction with an infinite sequence of elements 
ao,a1,... 1s the following limit if it exists 


lim [ao3a1 +++: a]. 
k-s00 
We denote it by [ag;a, :---]. 
Definition 1.4. The number [ao;a; : --- : ag_1] is called the k-convergent (or just 


convergent) to the finite or infinite continued fraction [ag;a : ---]. 


Definition 1.5. When the first element ao of a continued fraction is an integer and 
all other elements are positive integers, we say that this continued fraction is regular. 


1.1.2 Regular continued fractions for rational numbers 


Let us show how to construct a regular continued fraction for a rational number @. 
Algorithm to construct a regular continued fraction. 


Input data. Consider a rational number a. 


Goal of the algorithm. We are constructing a continued fraction for @. At each step 
we define a pair of integers (ay_1,rg). 


Step 1. If @ is an integer, then a = [a], and the algorithm terminates. Suppose now 
that o is not an integer. Let us subtract the integer part | @ | and invert the remainder, 
1.€., 
a =|a|+—— 
1/(a—|a]) 
Write aj = |a@ |, and continue with the remaining part r; = 1/(@— ag). 
Inductive Step k. Suppose we have completed k — 1 steps and get the numbers az_ 
and r;,_;. Let us find a,_, and rz: 
1 


1a lel): 


Then we set ay_) = |[rg_—1| and rg = 1/(r~_-1 — ag_1). 


ret = [re-a| + 


Output. The algorithm for noninteger numbers stops at Step n when r, is an integer. 
We get 

O = [ao3a1 2 +++ 2 dn—1 2 An, 
where dy = rp. 


Remark 1.6. Let r; = p;/q; with positive relatively prime integers p;, g; for i > 1. 
Then for every k > 1 we have 
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Pk+1 1 1 qk 


Ik-+1 tea [rel — [e/a] ~ Pe—4eLPK/ ae) 


Since 7; > 1, its denominator is less than its numerator (i.e., 9x11 < Pe+1 = Wk)- 
Hence the sequence of denominators g, decreases with the growth of k. Therefore, 
the algorithm stops in a finite number of steps. 


Example 1.7. Let us study one example: 


So we get the continued fraction [1;3 : 2]. 


1.1.3 Regular continued fractions and the Euclidean algorithm 


The term continued fraction was used for the first time by J. Wallis in 1695, but the 
story of continued fractions starts much earlier with the Euclidean algorithm, named 
after the Greek mathematician Euclid, who described it in his Elements (Books 
VII and X). Actually the algorithm was known before Euclid; it was mentioned 
in the Topics of Aristotle. For further historical details we refer to the book [63] by 
D.H. Fowler. The goal of the algorithm is to find the greatest common divisor of a 
pair of integers. Let us briefly describe the algorithm. 


Euclidean algorithm. 


Input data. Consider two positive integers p and q. 


Goal of the algorithm. To find the greatest common divisor of p and q (usually it 
is denoted by gcd(p,q)). We do this in several iterative steps. 


Step 1. Set go = q. Find integers ag and q; with g > r; > 0 such that 

P=409+ 41. 
Inductive Step k. Suppose that we have completed k — 1 steps and get the integers 
ay—2 and q,_1. Find integers a,_, and gx, where gx_1 > gx = 0 such that 


Dk-2 = Ak-19k-1 + Yk. 


Output. The algorithm stops at Step n when r, = 0. It is clear that 


ged(p,q) = ged(q,qi) = ged(qi,q2) = ++: = gced(qn—2,9n—1) = Qn-1- 
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Remark 1.8. Since the sequence (q,) is a decreasing sequence of positive integers, 
the algorithm always stops. 


Remark 1.9. The integers a,_; and rz at each step are uniquely defined. The Eu- 
clidean algorithm actually generates the elements of a continued fraction. We always 
have 

ean Pe eee 

— = [ag; ay 3 +++ 2 ay). 


1.1.4 Continued fractions with arbitrary elements 


In general one can consider continued fractions with arbitrary real elements. In 
Chapter 6 and in Chapter 17 we use continued fractions with arbitrary integer el- 
ements to describe integer triangles and later to generalize to the case of toric va- 
rieties. In Chapter 15 we present some of the geometric ideas behind continued 
fractions with arbitrary integer and real coefficients. In the rest of the book we shall 
deal with regular continued fractions. 

In the case of continued fractions with arbitrary elements, we could encounter 
infinity while calculating the value of a continued fraction. For instance, this is the 
case for the continued fraction [1;1 : 0]. To avoid the annoying consideration of 
different cases of infinities, we propose to add an element - to the field of real 
numbers R and define the following operations: 


co +d =a+o = oo, Pec —=0. 


Denote the resulting set by R. Notice that we do not add two elements: +o and —co, 
but only one: co. In some sense we are considering the projectivization of the real 
line. 

In this notation, for the continued fraction [1 : 1;0] we have 


1 1 1 
~=14 14 1+0=1. 
iG 1+ 00 oo 


[1;1:0] =14 


1.2 Convergence of infinite regular continued fractions 


In this section we show that a sequence of k-convergents of any infinite regular 
continued fraction converges to some real number. First, we give a formula for nu- 
merators and denominators of rational numbers in terms of elements of continued 
fractions of these numbers. Second, we introduce partial numerators and denomi- 
nators and prove some relations between them. Finally, we formulate and prove a 
theorem on convergence of infinite continued fractions. 
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Let us say a few words about the numerators and denominators of rational num- 
bers and the elements of their continued fractions. It turns out that the numerator 
and the denominator can be expressed as polynomial functions of the elements of 
the continued fraction. Namely, there exists a unique pair of polynomials P, and OQ; 
in the variables xo,...,x, with nonnegative integer coefficients such that 


Py(X0,--- Xk) 


Ox (x0, --« Xk) 


The first condition defines the polynomials only up to a multiplicative constant, so 
the second condition is a necessary normalization condition. 


=[xosx1:-++ tay], and P,(0,...,0)+Q;,(0,...,0) =1. 


Example 1.10. Let us calculate the above polynomials for the continued fractions 


[xo], [x03%1], [0341 : X2],-.... The polynomials are as follows: 
Po(x0) = xo, Qo(xo) = 1; 
Pi (x0,%1) = x0x1 +1, Q1(X0,%1) = 41; 


P2(x0,%1,%2) = Xox1xX2 +.x2 +.x0, O2(x0,%1,%2) = x1x2 +15. 


Now the numerator and the denominator are described via polynomials P and 
Q as follows. Consider a finite (or infinite) continued fraction [a9;a, : +++: dy] with 
n> k (or [ao;ay : ---] respectively). We set 


Pk = P;(ao, bare ak) and dk = Ox(ao, sai , Ak). 


In some sense the numbers p; and g,; may be considered partial numerators and par- 
tial denominators of rational numbers. Namely (by Proposition 1.13 below) these 
numbers are the numerators and denominators of k-convergents. If k = n, we get the 
numerator and the denominator of [ag;a1 : +++: an]. 

Let us prove several interesting facts and relations on the numbers p,; and q; (and 
hence that they are true numerators and denominators for k = n). In Proposition 1.12 
below we prove that the integers p; and gq, are relatively prime for every k. Then in 
Proposition 1.13 we prove an important recurrence relation on both numerators and 
denominators, which we will use in several further chapters. After that we deduce 
one more useful relation (see Proposition 1.15) and prove Theorem 1.16 on the 
convergence of infinite continued fractions. 


For the next several propositions we use some additional notation: 


Px = Py_1(a1,...,ax) and Gk = Ox—1(@1,---, x). 
We start with the following lemma. 


Lemma 1.11. The following holds: 


Pk =40PK+4k; 
dk = Px: 
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Proof. Since 


T= [aj3a2 +++: ag] 
qk 
we get 
Pe Ls 0 P+ 4K 
=dr=z x 7m é 
Ik Pr/Qk Px 
Therefore, 


Gk = A Px. 


Now we rewrite the second condition for polynomials P, and Q;: 


ie =A (aopr+ 4), 


1=P,(0,...,0) + O;(0,...,0) 
=A (aoPr_1(0,...,0) + Qg_1(0,...,0)) HA P_1(0,...,0). 


It is clear that the last expression represents a positive integer that is divisible by A. 
Hence | is divisible by A, and therefore A = 1. 


Proposition 1.12. Let (ao; : ---: a] be a continued fraction with integer elements. 
Then the corresponding integers p, and qx are relatively prime. 


Proof. We prove the statement by induction on k. 
It is clear that pp = ao and qo = | are relatively prime. 
Suppose that the statement holds for k—1. Then p,; and q, are relatively prime by 
the induction assumption. Now the statement holds directly from the equalities of 
Lemma 1.11. 


Let us now present a recursive definition of partial numerators and denominators. 
Proposition 1.13. For every integer k we get 
Pk = 4kPk-1 + Pk-2; 
Ik = Gk-1 + Fk-2- 


Proof. We prove the statement by induction on k. 
For k = 2 the statement holds, since 

By oe and ane 

go 1 q1 ay 


aga; +1 


x 


and therefore 
P2 _ 42+a041d2+d90 _ 42pit Po 


Q ajay +1 arqit+qo 


Suppose the statement holds for k— 1. Let us prove it for k: 
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ee eer eee _ 40 (4k PK-1 + Pe—-2) + an Ge—1 + Ge-2 
dk Pr/Gk ae as ay Pr—1 + Pe-2 
* ax (AoPr—1 + 4x—1) + (GoPe—2 + Fx-2) _ UKPk-1 t+ Pk-2 
An PR—1 + P-2 AGk—1 + Gk-2 


(The last equality holds by Lemma 1.11.) Therefore, the relations of the system 
hold. 


From Proposition 1.13 it follows that the partial numerators and denominators are 
bounded by the Fibonacci numbers F; (defined inductively as follows: F; = Fy = 1, 
and F, = F,-1 + F,-2). 


Corollary 1.14. For regular continued fractions the following estimates hold: 
Pel > Fe and qx > Fist. 
Proof. We prove the statement by induction on k. Direct calculations show that 


Pol >9, |pil=1, and |p2|>1, 
lgol=1, lgilJ=1, and |q2|>2. 


Suppose that the statement holds for k— 2 and k — 1, we prove it for k. Notice 
that the g, are all positive and the p, are either all negative or all nonnegative. We 
apply Proposition 1.13: 


|Pk| = 4k|Pk—1| +|Pe-2| = 1 -Fe-1 + Fe-2 = Fi 


and 
Dk = UGk—-1 + Ok-2 2 1 Fe + Pee = Fey. 


This concludes the proof. 


In the next proposition we present another useful expression for partial numera- 
tors and denominators. 


Proposition 1.15. For every k > 1, the following holds: 


Pk-1 pe _ (1 
Gk-1 sk Gk-19k 


Proof. Let us multiply both sides by gx qx. We get 


Pk-19k — PeGe—1 = (-1)*. 


We prove this by induction on k. 
For k = 1 we have 


P0d1 — P19o = 4149 — (jag +1) = —1. 
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Suppose the statement holds for k — 1. Let us prove it for k. By Proposition 1.13 
we get 


Pk-19k — PkOk—1 = Pk—1(Gk9k—1 + G2) — (Ge PR—-1 + Pk—-2) Uk-1 
= —(Pk-29k-1 — Pk-19k-2) = (-1)*. 


Therefore, the statement holds. 


Now we are ready to prove the following fundamental theorem. 


Theorem 1.16. The sequence of k-convergents of an arbitrary regular continued 
fraction converges. 


Notice that the theorem does not always hold for continued fractions with arbi- 
trary real elements. 


Proof. Let [ag;a1 : ---] be an infinite regular continued fraction. From Proposi- 
tion 1.15 and Corollary 1.14 we have 


ss 1 
Pk-1 _ Pk < 
Ge-1 4k | Peet 
Since the sum 
y 1 
on PP e+ 
converges (we leave this statement as an exercise for the reader), the sequence (2) 


is a Cauchy sequence. Therefore, (P 


) converges. 


1.3 Continuants 


Let us briefly mention a general approach to partial numerators and partial denomi- 
nators via continuants. 


Definition 1.17. Let 1 be a positive integer. A continuant K, is a polynomial with 
integer coefficients defined recursively by 


K_() =0; 

Ko) = 1; 

K\(a1) =a; 

Kn(a1,42,---,;An) = 4nKn—1(a1,42,.--,An—1) + Kn-2(a1,€2,---,An—2). 


Remark 1.18. Directly by construction of continuants we have 


Py, (ao, -+-;4n) = Kn+1(@0,41,42,---,4n) and Qn(ao,...,4n) =Kn(a1,a2,...,dn). 
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Therefore, 
Kn41(d0,41,--»,4n) 
Kn(a1,42,---;4n) 


= [ao3a) 2 +++: dl. 


1.4 Existence and uniqueness of a regular continued fraction for 
a given real number 


In the next theorem we show that for every real number there exists a regular con- 
tinued fraction representing it. For an irrational number, the corresponding regular 
continued fraction is unique and infinite. For a rational number there are exactly two 
continued fractions, 


[a0341 +++ dy] = [ao;ay : +++: a,—1: 1], 


with one of them odd, the other even. 


Theorem 1.19. (i) For every rational number there exist a unique odd and a unique 
even regular continued fraction. 

(ii) For every irrational number @ there exists a unique infinite regular continued 
fraction whose k-convergents converge to Q.. 


For instance, 2 = [1;3 : 2] = [1;3: 1:1] and w = [3;7:15:1:292:1:1:1:2: 
sal 


Proof. Existence. In Section 1.1 we have shown how to construct a regular contin- 
ued fraction [ao9;a1 : ---a,] for a rational number q@. Notice that if @ is rational but 
not an integer, then a, > 1, and therefore, 


O = |ao;a1 2 +++ 2 An] = [aos +++ Gn—1: I. 


One of these continued fractions is odd and the other is even. For an integer @ we 
always get & = [a] = [a@—1;1]. 

In the case of an irrational number a, the algorithm never terminates, generating 
the regular continued fraction a’ = [ag;a1 : a2 : ---] and the sequence of remainders 
rx > 1 such that 


a= [a3 ORG eel > rx]. 
Let us show that @ = a’. From Proposition 1.13 for [ao;a, : +--+: ag_1 : rg] and 
[a0;a1 +++: dg_, : +++] we have 
{fn + Pn— —1An + Pn— 
Pr eg aA Pn-2 aud Pn _ Pn-14n Pn 2 
Gn—-1'n 7 Qn-2 Qn Gn—-14n + Gn—-2 


Using these expressions and the fact that a, = |r, |, we have 
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Pel _|f Pn—19n—2 — Pn—2Gn-1) (Tn — Gn) 
(dn— no + @n—2)(Gn—14n + Gn—-2) 


1 


; UE In + An— 2)(Gn—14n + Gn—2) 
1 


< 
ae 19n F, ey 


A 


The last inequality follows from Corollary 1.14. 
Therefore, the sequence (2) converges to a, and hence @ = a’. 


Uniqueness. Consider a rational number a. Let us prove the uniqueness of the 


finite regular continued fraction & = [ag;a1 : +--+: dn], where a, 4 1. We prove this 
by reductio ad absurdum. 
Suppose 
aif ae ena ote ao, Sleps Z reer a ne RR a eee ee ea 
O, = (ag; 41 2 +++ 2 et Aggy +++ An] = [agjay 2 +++ tags Agyy toes On], 


where ay,., # a, ;. Then we have 


are / / 
— Pkt Pel PRe¢ 1 PR PKR 4 1 Pk-1 
_ a Pe / o J “ 
AKVKAL TT k-1 Velen 7 Uk-1 Dk" p 44 T Ik-1 


Therefore, rp41 = r4,, and thus ag41 = [resi] = Lyi) = a¢41. We arrived at a 
contradiction. 


The proof of uniqueness for continued fractions of irrational numbers is the same 
as in the case of rational numbers. 


1.5 Monotone behavior of convergents 


In this section we prove two statements on the monotone behavior of convergents. 
We start with a statement on the sequences of odd and even convergents. 


Proposition 1.20. (i) The sequence of even convergents (p2/q2x) is increasing, 
and the sequence of odd convergents (pox+1/G2K+1) is decreasing. 
(ii) For every real & and a nonnegative integer k we have 


Pak < oy and Pak+1 
q2k Q2k4+1 


>a. 


Equality holds only for the last convergent in case of rational Q. 


Proof. (i). By Proposition 1.13 we have 


Pm-2 Pm = (22 fat (2 Pa) _ ay" ( 1 1 ) 
Am-2 dm dm-2 dm-1 dm-1 dm dm-1 Gm-2. dm 
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Since the sequence of the denominators (q;) is increasing (by Proposition 1.15), we 
have that pm—2/qm—2 is greater than pm/qm for all even m, and it is less for all odd 
m. This concludes the proof of (i). 


(ii). The sequence of even (odd) convergents is increasing (decreasing) and tends 
to & in the irrational case, and we end up with some p,/qy = @ in the rational case. 
This implies the second statement of the proposition. 


In the second statement we show that the larger k is, the better a k-convergent 
approximates . 
Proposition 1.21. The sequence of real numbers 
ja — Al, C=O acs, 
qk 


is strongly decreasing, except for the case of & = |ag,1,1], where this sequence 
consists of the following three elements: (1/2,1/2,0). 


Proof. Recall that r, denotes the reminder [ag+1;ax42 :---]- 


Let us first prove that 


jar |or]| > Jo — 24] 
1 
This inequality is equivalent to 
+ > jao+ + : 
ag + ———— —-a a a ; 
e a, +1/ro : m a, +1/ro ay 
which is equivalent to 
1 1/r2 


> , 
a, +1/r2 ayi(a; +1/r2) 
and further to 
airy > 1. 


The only case in which ajr2 < 1 is a} =z = 1. This is exactly the exceptional case 
mentioned in the formulation of the theorem. 


Now we proceed with the general case of k > 2. From Proposition 1.13 we have 


_ 1 
la Pk “| = |Pk—19k—2 — 9k—-1Pk-2 = (1.2) 
Ik-1 Gk-1(Gk-1Fk +42) Mk-1(Gk-1k + 4-2) 
Similarly, we have 
1 
la P ‘| = (1.3) 
gk! dk (Gere + dk-1) 


Let us estimate the expression on the right side of the last equality: 
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1 1 1 
< 


I (eT K+ + M1) ~ Gk Gk + 9-1) Ii (Ga—1k + 4k-2 +4451) (1.4) 


< < 
Gk (G@k—1(ak +1) + 9K—-2) ~ Ge(Ge—-1rk + 9k-2) > Ik—-1 (Ge—-1 kK + Gk-2) 


In the last inequality we used the fact that gx > q,x_1, which follows directly from 
Proposition 1.13 for k > 2: 


Dk = Fk-14k + Fk—2 > k-1- 


Let us substitute the first and the last expressions of inequality (1.4) by equali- 
ties (1.2) and (1.3). We get 


la PA) > la Peat 
qk 


qk-1 


Therefore, the sequence is strictly decreasing. 


1.6 Approximation rates of regular continued fractions 


It is interesting to compare advantages and disadvantages of continued fractions 
with respect to decimal expansions. It is clear that it is very easy algorithmically to 
add and to multiply two numbers if you know their decimal expansions, while for 
continued fractions these operations are hard. For instance, there is no algebraic ex- 
pression for the elements of sums and products via elements of summands or factors. 
Probably that is the reason why in real life, decimal expansions are widely known 
and continued fractions are not. On the other hand, quadratic irrationalities have 
periodic continued fractions, while their decimal expansions are not distinguishable 
from an arbitrary transcendental number (see Chapter 12). In addition, continued 
fractions give exact expressions for certain expressions involving exponents and 
logarithms (see Exercise 1.4 below). They play a key role in Gauss’s reduction the- 
ory for describing SL(2,Z) matrices (see Chapter 11). It turns out that convergents 
of continued fractions are good as rational approximations of real numbers. In this 
section we say a few words about the approximation rates of convergents. Later, in 
Chapter 14, we study questions related to best approximations of real numbers by 
rational numbers. 


Theorem 1.22. Consider the inequality 


c 
a 2 <a (1.5) 
Letc> a Then for every , the inequality has an infinite number of integer solu- 
tions (p,q). 


Let us reformulate the essence of Theorem 1.22. 
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Lemma 1.23. Theorem 1.22 is true if for every irrational a, there are infinitely 
many integer solutions (p,q) of 


ets 5 
q|— V5q 


Proof. It is clear that for rational a = p/g the integer pairs (np,nq) are solutions. 
In the irrational case it is enough to prove the theorem for c = ot Let us prove that 


the equality can occur at most twice. Suppose we have 


for some choice of signs. Then 
Dp ?p 1 ( 1 1 ) 
EO Ws EF ge 


Since | and 5 are linearly independent over Q, we have 


and |q| = |4l. 


Hence equality holds at most for two pairs of integers (p,q) and (—p, —q). 


Our next step in the proof of Theorem 1.22 is to give an estimate on the growth 
of denominators. 


Lemma 1.24. Consider a real number a with regular continued fraction |ag;a, : 
--+]. Let px/qx be its k-convergents. Then for an infinite sequence of integers we 
have 

Ik s 1+ V5 


d-1 iD. 


Proof. By Proposition 1.13 we have 


Gig, MD 
dk-1 qk-1 
Hence if a, > 2, then 
1 5 
8 Bp gps 
Wk-1 2 


Then 
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= > — 
Ge. 
h2 geo! dws 4 2 


Gk-1 1 1 145 


Therefore, at least one of every two sequential numbers gx_1/qx—2 and qx/qx—1 
satisfies the condition of the lemma, and hence there are infinitely many q,/qx—1 
satisfying the condition. 


Proof of Theorem 1.22. From Lemma 1.23 it is enough to show that for every irra- 
tional a there exist infinitely many pairs of integers (p,q) satisfying 


a-=|< 


_ V5 ¢ 


Consider two consecutive convergents. From Proposition 1.15 we have 


Pp | 1 


_ 1 
Gk-19k- 


Pk-1 Pk 
Gk-1 Ik 


In addition, from Proposition 1.20 it follows that @ is contained in the segment with 
endpoints px_1/qx—1 and px/qx. Hence if the distance between endpoints is small 


enough, namely 
1 1 1 


< a, ; 
g-19k 5g, V5G% 


then either (px_1,qx-1) or (px,gx) is a solution of the above equation. The last 
inequality is equivalent to the following: 


(4) va() Lasso: 
H14+/5 


The quadratic polynomial on the left side has two roots: ==>. Hence the inequal- 
ity holds when 


HK sy l+Vv5 

M-1 2. 
From Lemma 1.24 we have infinitely many k satisfying the last inequality. There- 
fore, the number of solutions of the above inequality is also infinite. This concludes 
the proof of Theorem 1.22. 


It turns out that for certain numbers the estimate of the approximation rate cannot 
be essentially improved. 


Proposition 1.25. Let a be the golden ratio, i.e., 


14V5 
2 


=([l;l:1:1:1:---] 


Ifc< a then inequality (1.5) has only finitely many solutions. 


1.7 Exercises 19 


Denote the golden ratio by @ and its conjugate (1 — 5) /2 by 0. 


Proof. First of all, let us show that it is enough to check only all the convergents 
Px/qx- From Theorem 1.16 it follows that best approximations are convergents to a 
number. Let p/g be a rational number such that gx <q < x41. Therefore, 
ele 2 zee Mzalo—| 
q qk 
Hence if p/gq is a solution of inequality (1.5), then p;/g, is a solution of inequal- 
ity (1.5) as well. Therefore, if there are infinitely many solutions of inequality (1.5) 
then there are infinitely many convergents to the golden ratio among them. 
Second, we prove the statement for the convergents. From Proposition 1.13 it fol- 


lows that the k-convergent to the golden ratio equals Fy) /F;. Recall Binet’s formula 
for Fibonacci numbers via the golden ratio and its conjugate: 


(for more details see [221]). We have 


lo Pk-1 =| Fey C k+l _@kt! | a | 13% 
qk-1 Fi ok 6" ato" | | (ak—@")” 
1 Halk 
= a5 |1- 0". 


V5F; 
Since |O| < 1, we have |1 6" | = 1+ 0(1) and hence 


1 1 
“ath 
V5q%-1 qi-1 


CU Pk-1 
kl 


This implies the statement for convergents and concludes the proof of Proposi- 
tion 1.25. 


1.7 Exercises 


Exercise 1.1. Prove that for every k > 2 we get 


Pk-2__ Pk _ (—1)*a;, 
Gk-2— Ak WkUk—-2 


Exercise 1.2. Prove that for every k > 1 we get 
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a ree Seite 
—— = [agyag_1 t+ ay]. 
qk-1 
Exercise 1.3. Consider the convergents of a regular continued fraction. Prove that 
for every k > 0 we get 
1 


Gk (Gk+1 + 49k) 


1 
> | a Pk 
DkIk+1 qk 


Exercise 1.4. Prove that (a) V2 = [1; (2)]; 
(b) exp(1) = [2;1:2:1:1:4:1:1:6:1:1:8:1:1:10:---]. 


Exercise 1.5. Consider an irrational number @. 

(a) Suppose that we know that @ = 4,17. Is it true that te is its best approximation? 
(b) Find the set of all real numbers for which the rational number [1;2 : 3 : 4] is one 
of the best approximations. 


Exercise 1.6. Construct an infinite continued fraction with real coefficients that has 
exactly two limit points: | and —1. 


Check for 
updates | 


Chapter 2 
On Integer Geometry 


In many questions, the geometric approach gives an intuitive visualization that leads 
to a better understanding of a problem and sometimes even to its solution. In the next 
chapters we give an interpretation of the elements of continued fractions in terms 
of integer geometry, with the continued fractions being associated to certain invari- 
ants of integer angles. The geometric viewpoint on continued fractions also gives 
key ideas for generalizing Gauss—Kuzmin statistics to studying multidimensional 
Gauss’s reduction theory, leading to several results in toric geometry. 

The notion of geometry in general can be interpreted in many different ways. 
In this book we think of a geometry as of a set of objects and a congruence re- 
lation, which is normally defined by some group of transformations. For instance, 
in Euclidean geometry in the plane, we study points, lines, segments, polygons, 
circles, etc., with the congruence relation being defined by the group of all length- 
preserving transformations of the plane (this group is generated by all symmetries 
about the lines in the plane). The aim of this chapter is to introduce basic ideas of 
integer geometry. In order to have a general impression we start with the simplest, 
two-dimensional, case. We will need multidimensional integer geometry only in the 
second part of the book, so we describe it later, in Chapters 18 and 19. 

We start in Section 2.1 with general definitions of integer geometry, and in par- 
ticular, define integer lengths, distances, areas of triangles, and indexes of angles. 
Further, in Sections 2.2 and 2.3 we extend the notion of integer area to the case of ar- 
bitrary polygons whose vertices have integer coordinates. In Section 2.4 we formu- 
late and prove the famous Pick’s formula that shows how to find areas of polygons 
simply by counting points with integer coordinates contained in them. Further in 
Section 2.5 we show that there are only 6 distinct elementary integer-regular poly- 
gons in the plane. Finally, in Section 2.6 we formulate one theorem in the spirit of 
Pick’s theorem: it is the so-called twelve-point theorem. 
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Fig. 2.1 The following triangles are integer congruent. 


2.1 Basic notions and definitions 


2.1.1 Objects and congruence relation of integer geometry 


We consider the integer lattice Z* in R*. The objects of integer geometry are integer 
points with integer coordinates, integer segments and polygons having all vertices 
in the integer lattice, integer lines passing through pairs of integer points, integer 
angles with vertices at integer points. The congruence relation is defined by the 
group of affine transformations preserving the integer lattice, ie. Aff(2,Z). This 
group is a semidirect product of GL(2,Z) and the group of translations on integer 
vectors. We use = to indicate that two objects are integer congruent. 

The congruence relation is slightly different from that in the Euclidean case. We 
illustrate this with integer congruent triangles in Fig. 2.1. 


Remark 2.1. To avoid confusion we add prefixes for triangles and angles, writing 
AABC and ZABC respectively. 


2.1.2 Invariants of integer geometry 


As long as we have objects and a group of transformations that acts on the objects, 
we get invariants — quantities that are preserved by transformations of the objects. 
Usually the study of these invariants is the main subject of the corresponding ge- 
ometry. For instance, in Euclidean geometry, the invariants are lengths of segments, 
areas of polyhedra, measures of angles, etc. 

So what are the invariants in integer geometry? There are two different types of 
invariants in integer geometry: affine invariants and lattice invariants. Affine invari- 
ants are intrinsic to affine geometries in general. Affine transformations preserve 

— the set of lines; 

— the property that a point on a line is between two other points; 

— the property that two points are in one half-plane with respect to a line. 
Lattice invariants are induced by the group structure of the vectors in the lattice. The 
majority of lattice invariants are indices of certain subgroups (i.e., sublattices). 
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2.1.3 Index of sublattices 


Recall several notions from group theory. Let H be a subgroup of a group G. Con- 
sider an element g € G. The sets 


gH ={gh|heH} and Hg = {hg |he H} 


are called left and right cosets of H in G. 

The index of the subgroup H in the group G is the number of left cosets of H in 
G, denoted by |G: H|. (For example, for a positive integer n, we have |Z : nZ| =n.) 
Note that within this book we work with abelian groups Z‘, where for every g left 
cosets of g coincide with right cosets, i.e., gH = Hg. 


In integer geometry we consider the additive group of integer vectors Z”, which 
is called the integer lattice. A subgroup of an integer lattice is called an integer 
sublattice of the integer lattice. Actually, any integer sublattice is isomorphic to Z* 
for k <n, with the sublattice contained in some k-dimensional linear subspace of 
R”. The number k is the dimension of the sublattice. 

Let A be an integer point and G be an integer sublattice. We say that the set of 
points 

L={A+g|g¢€G} 


is the integer affine sublattice. Basically, the group G does not depend on the choice 
of the point A in L. The invariant definition for G is as follows: the lattice G is the 
set of all vectors with endpoints in L. 

Consider two integer affine sublattices L; C Lz whose integer sublattices G; and 
G» have the same dimension. It is clear that Gj C G2, so the index |G : Gi| is 
well defined. Since |Gz : G;| is an invariant of the pair (G1, G2) under the action of 
GL(n,Z), it is also an invariant of the pair (Z;,L2) under the action of Aff(n,Z). 


One of the simple ways to calculate the index of a two-dimensional sublattice in 
Z? is by counting integer points in a basis parallelogram for the sublattice. 


Proposition 2.2. The index of a sublattice generated by a pair of integer vectors v 
and w in Z? equals the number of all integer points P satisfying 


AP=av+Bw with 0O<a<1; 0<B<1, 
where A is an arbitrary integer point. 
Proof. Let H be the subgroup of Z? generated by v and w. Define 
Par(v,w) = {av+Bw|0<a,B < 1}. 


First, we show that for every integer vector g there exists an integer point P € 
Par(v,w) such that AP € gH. The point P is constructed as follows. Let 


g=Ayvt+)ow. 
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Consider 


P=(Ai—|Ai|)v+ (Ar—|Aa| w+. 
Since 
0<A,—-|A1J,A2—[A2] <1, 
the point P is inside the parallelogram and AP € gH. 


Second, we prove the uniqueness of P. Suppose that for two integer points 
P,P) € Par(v,w) we have AP; € gH and AP» © gH. Hence the vector P,P; is an 
element in H. The only element of H of type 


av+Bw with 0<a,B <1 


is the zero vector. Hence P; = P». 


Therefore, the integer points of Par(v,w) are in one-to-one correspondence to the 
cosets of H in Z?. 


Example 2.3. Consider the points A, B, and C as follows: 
NX 
"B A 


Then we have six points satisfying the above condition. Therefore, the index of the 
sublattice generated by AB and AC is 6. 


2.1.4 Integer length of integer segments 


Now we are ready to define several integer invariants. We start with integer length. 


Definition 2.4. The integer length of an integer segment AB is the number of integer 
points in the interior of AB plus one. We denote it by 1¢(AB). 


For example, the integer lengths of the segments AB and CD are both equal to 2. 


B 
ae (AB) =2, (CD) =2 
A > 


an alternative invariant definition is as follows: the integer length of an integer 
segment AB is the index of the sublattice generated by the vector B —A in the lattice 
of integer points in the line containing AB. 

In Euclidean geometry the length is a complete invariant of congruence classes 
of segments. The same is true in lattice geometry. 
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Proposition 2.5. Two integer segments are congruent if and only if they have the 
same integer length. 


Proof. Let AB be an integer segment of length k. Let us prove that it is integer 
congruent to the integer segment with endpoints O(0,0) and K(k,0). Consider an 
integer translation sending A to the origin O. Let this translation send B to an integer 
point B’ (x,y). 

Since 1¢(OB’) = 1€(AB) = k, we have x = kx’ and y = ky’, where x’ and y’ are 
relatively prime. Hence there exists an integer pair (a,b) such that 


bx’ —ay' =1. 


es 


is invertible, and therefore, it is in SL(2,Z). This matrix takes the segment OK to 
OB’, and thus OK © OB’ © OB. Hence all the segments of integer length k are 
congruent to the segment OK. Therefore, they are all integer congruent. 


Hence the matrix 


2.1.5 Integer distance to integer lines 


There is no natural simple definition of orthogonal vectors in integer geometry. Still, 
it is possible to give a natural definition of an integer distance from a point to a line. 


Definition 2.6. Consider integer points A, B, and C that do not lie in one line. An 
integer distance from the point A to the integer segment (line) BC is the index of the 
sublattice generated by all integer vectors AV, where V is an integer point of the line 
BC in the integer lattice. We denote it by 1d(A, BC). 

The points A, B, and C are collinear, we agree to say that the integer distance from 
A to BC is zero. 


One of the geometric interpretations of integer distance from a point A to BC is 
as follows. Draw all integer lines parallel to BC. One of them contains the point A 
(let us call it AA’). The integer distance Id(A, BC) is the number of lines in the region 
bounded by the lines AB and AA’ plus one. For the example of Fig. 2.2, we have the 


following: 
C 
Po asa ; 


There are two integer lines parallel to AB and lying in the region with boundary lines 
AB and AA’. Hence, Id(A,BC) = 241 =3. 


26 2 On Integer Geometry 


. oC. . e 
eet: ld(A, BC) =3 
B A~ 
. 3G . . ae . . e 
VAN —- \p + + IS(ABO)=6 
B A B . rn 
e Po e e e 
_ \; -_ la(BAC) = 2 
BR t ) ae e 


Fig. 2.2 A triangle AABC and the sublattices for calculation of 1d(A,BC), 1S(AABC), and 
lot(ZBAC). 


2.1.6 Integer area of integer triangles 


Let us start with the notion of integer area for integer triangles. We will define the 
integer areas for polygons later, in the next subsection. 


Definition 2.7. The integer area, denoted by 1S(AABC), of an integer triangle 
AABC is the index of the sublattice generated by the vectors AB and AC in the 
integer lattice. 

For the points A, B, and C in one line we say that 1IS(AABC) = 0. 


For instance, the triangle in Fig. 2.2 has integer area equal to 6. 

As in Euclidean geometry, integer area does not uniquely determine the congru- 
ence class of triangles. Nevertheless, all integer triangles of unit integer area are 
congruent, since the vectors of the edges of such triangles generate the integer lat- 
tice. 


2.1.7 Index of rational angles 


An angle is called rational if its vertex is an integer point and both its edges contain 
integer points other than the vertex. 


Definition 2.8. The index of a rational angle ZBAC, denoted by la(ZBAC), is the 
index of the sublattice generated by all integer vectors of the lines AB and AC in the 
integer lattice. 

In addition, if the points A, B, and C are collinear, we say that la(ZBAC) = 0. 
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Geometrically, the index of an angle ZBAC is the integer area of the smallest tri- 
angle AAB’C’ whose edges AB’ and AC’ generate the sublattices of lines containing 
the corresponding edges. For instance, the angle ZBAC in Fig. 2.2 has index equal 
to 2: 


C=C 


Let us conclude this subsection with a general remark. 


Remark 2.9. We write 1@, ld, 1S, and lo (starting with the letter “I’”’) to indicate that 
all these notions are well defined for any lattice, and not just for the integer lattice. 


2.1.8 Congruence of rational angles 


Let us formulate a simple necessary and sufficient condition of integer congruence 
of two integer angles. 


Proposition 2.10. Consider four integer points 
Ai =(pi,qgi), and B;=(ri,s;) fori=1,2. 


Then the angle ZA,OB, is integer congruent to the angle ZA,OBz if and only if the 
below matrix is in GL(2,Z): 


P2 ry Pi ry = 


7 — | 8c4(p2.92) gcd(r,82) ) | ged(pi,qi) ged(ri,s1) 
7 q2 52 7 SI 


gcd(p2,q2) gced(r2,52) gcd(pi,qi) ged(ri,51) 


Remark 2.11. If all the coordinates of the points in the above proposition are rela- 
tively prime to each other then 


TH & ) iC ais 
q2 $2 771 $1 
Proof of Proposition 2.10. Denote by A‘, and B' the nonzero integer points on the 
rays OA; and OB; closest to O (i = 1,2). Then the matrix T is the transition ma- 
trix from the basis (OA), OB‘) to the basis (OA4,OB’,). Hence the statement of the 


proposition follows directly from the definition of the integer congruence for the 
angles. 


Remark 2.12. To verify that a matrix is in GL(2,Z) one checks if the determinant 
of the matrix is equal to +1 and that all the elements of the matrix are integers. 


Example 2.13. Consider 
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A=(3,-4) and B=(3,2). 


Let us check if the angles ZAOB and ZBOA are congruent. We compute 


eee 3). = (a5): 


Although the value of the determinant of T is —1, one of the elements of T is not an 
integer. Therefore the angles ZAOB and ZBOA are not integer congruent. 


2.2 Empty triangles: their integer and Euclidean areas 


Let us completely study the case of the following ’smallest” triangles. 


Definition 2.14. An integer triangle is called empty if it does not contain integer 
points other than its vertices. 


We show that empty triangles are exactly the triangles of integer area 1 and Eu- 
clidean area 1/2. In particular, this means that all empty triangles are integer con- 
gruent (as we show later, this is not true in the multidimensional case for empty 
tetrahedra). 

Denote by S(AABC) the Euclidean area of the triangle AABC. Recall that 


S(AABC) = 5 | det(AB,A0)). 


(Here by det(v,w) we denote the determinant of the (2 x 2) matrix whose first and 
second columns contain the coordinates of the vectors v and w respectively.) 


Proposition 2.15. Consider an integer triangle AABC. Then the following state- 
ments are equivalent: 

(a) AABC is empty; 

(b) IS(AABC) = 1; 

(c) S(AABC) = 1/2. 


Proof. (a) = (b). Let an integer triangle AABC be empty. Then by symmetry, a 
parallelogram with edges AB and AC does not contain integer points except for 
its vertices as well. Therefore, by Proposition 2.2 there is only one coset for the 
subgroup generated by AB and AC. Hence, AB and AC generate the integer lattice, 
and 1S(AABC) = 1. 


(b) => (c). Let IS(AABC) = 1. Hence the vectors AB and AC generate the integer 
lattice. Thus any integer point is an integer combination of them, so 


(1,0) =AjAB+ AAC and (0,1) = AB+ py AC, 
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with integers A, , Ao, 111, U2. Let also 
AB = b,(1,0)+52(0,1) and AC =c (1,0) +c2(0,1) 


for some integers a),a2,b,,b2. Therefore, 


by Cl Mu Ui\ 10 
bo C2 Ag Lo ~\O1)° 
Since these matrices are integer, both their determinants equal either | or —1. Hence 


the Euclidean area of AABC coincides with the Euclidean area of the triangle with 
vertices (0,0), (1,0), and (0,1) and equals 1/2. 


(c) = (a). Consider an integer triangle of Euclidean area 1/2. Suppose that it has 
an integer point in the interior or on its sides. Then there exists an integer triangle 
with Euclidean area smaller than 1/2, which is impossible (since the determinant of 
two integer vectors is an integer). 


Remark 2.16. Proposition 2.15 implies that all empty integer triangles are congru- 
ent. 


2.3 Integer area of polygons 


In this section we show that every integer polygon is decomposable into empty 
triangles. Then we define the integer area of polygons to be the number of empty 
triangles in the decomposition. Additionally, we show that this definition of integer 
area is well defined and coincides with the definition of the integer area of triangles. 


Consider a closed broken line AgA,...Ay—1An (An = Ao) with finitely many ver- 
tices and without self-intersections in R*. By the polygonal Jordan curve theorem, 
this broken line separates the plane into two regions: one of them is homeomorphic 
to a disk and the other to an annulus. An n-gon (or just a polygon) A, ...Ay is the 
closure of the region homeomorphic to a disk. 


Proposition 2.17. (i) Every integer polygon admits a decomposition into empty 
triangles. 

(ii) Two decompositions of the same polygon into empty triangles have the same 
number of empty triangles. 

(iii) The number of empty triangles in any decomposition of a triangle AABC equals 
IS(AABC). 


Proposition 2.17 introduces a natural extension of the integer area to the case of 
integer polygons. 


Definition 2.18. The integer area of an integer polygon is the number of empty 
triangles in its decomposition into empty triangles. 
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Proof of Proposition 2.17. (i). We prove the statement for integer n-gons by induc- 
tion on n. 


Base of induction. We start with triangles. To prove the statement for triangles 
we use another induction on the number of integer points in the closure triangle. 

If there are exactly three integer points in the closure of a triangle, then they 
are the vertices of the triangle (since the triangle is integer). Hence this triangle is 
empty; the decomposition consists of one empty triangle. 

Suppose that every triangle with k’ < k integer points (k > 3) is decomposable 
into empty triangles. Consider an arbitrary integer triangle AABC with k integer 
points. Since k > 3, there exists an integer point P in the triangle distinct from 
the vertices. Decompose the triangle into AABP, ABCP, and ACAP (excluding 
triangles of zero area). This decomposition consists of at least two triangles, and 
each of these triangles has at most k—1 integer points. By the induction assumption 
each of these triangles admits a decomposition into empty triangles. Hence AABC 
is decomposable as well. 

Therefore, every integer triangle admits a decomposition into integer triangles. 


Induction step. Suppose that every integer k’-gon for k’ < k is decomposable into 
empty triangles. Let us find a decomposition for an arbitrary integer k-gon A; ...Ax. 
If there exists an integer index s such that As, As+1, As+2 are collinear then we just 
remove one of the points and reduce the number of vertices. Suppose that this is 
not the case. Consider the ray with vertex at A; and containing A,A> and start to 
rotate it in the direction of the vertex A3. We stop when we reach A3 (then A; = A3) 
or at the moment when the ray contains some vertex A; at the same half-plane as 
A, with respect to the line A2A3 for the first time. Now we decompose the poly- 
gon A,...Agz into polygons A,...As and AjA;...A,z. Both of these polygons have 
fewer than k vertices. By the induction assumption each of these polygons admits a 
decomposition into empty triangles. Hence A, ...A;z is decomposable as well. 

This concludes the proof of (i). 


(ii). Let a polygon P have two decompositions, one into n; empty triangles and the 
other into nz empty triangles. Then by Proposition 2.15 we have 


ny n2 


since Euclidean area is additive. Hence, nj = no. 


(iii). Consider an integer triangle AABC. Define 
Par(AB,AC) = {A+ av+Bwl0< a,B <1}. 


Denote by #(AABC) the number of empty triangles in the decomposition (by (i) and 
(ii) this is well defined). Let us prove that if IS(AABC) =n, then #(AABC) =n, by 
induction on n. 
Base of induction. Suppose that 1S(AABC) = 1. Then by Proposition 2.15 we have 
#(AABC) = 1. 
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Fig. 2.3 Integer area is additive. 


Induction step. Suppose that the statement holds for every k’ < k, for k > 1. Let 
us prove it for k. Consider a triangle AABC such that 1IS(AABC) = k. By Propo- 
sition 2.2, there exists an integer point P in the parallelogram Par(AB,AC) distinct 
from the vertices. Without loss of generality, we suppose that P is not on the edge 
AB. Set Q=P+AB and D=C+AB. 

The triangle ABQD is obtained from AAPC by shifting by the integer vector AB 
(see Fig. 2.3). Hence the total number of orbits of integer points with respect to the 
shift on vector multiples of AB in the parallelograms Par(AB,AP) and Par(PQ, PC) 
equals the number of such orbits in the parallelogram Par(AB,AC). Therefore, from 
Proposition 2.2, we get 


IS(AABC) = 1S(AAPB) +1S(ACPD). 
By Proposition 2.15 we have 


#(AABC) = 25(AABC) = S(ABCD) = S(ABOP) + S(CPQD) 
= 25(AABP) + 25(ACPD) = #(AAPB) +#(ACPD). 


Since P is not on AB, we have #(AAPB) < k and #(ACPD) < k. Therefore, by 
the inductive hypothesis, we get 


IS(AABC) = 1S(AAPB) +18(ACPD) = #(AAPB) +#(ACPD) = #(AABC). 


The proof is completed by induction. 


Corollary 2.19. The integer area of polygons in the plane is twice the Euclidean 
area. 


Proof. The statement holds for empty triangles by Proposition 2.15. Now the corol- 
lary follows directly from Proposition 2.17 and the definition of integer area. 


Remark 2.20. Corollary 2.19 implies that the index of an integer sublattice gener- 
ated by vectors AB and AC in the integer lattice equals 


| det(AB, AC). 
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2.4 Pick’s formula 


We conclude this section with a nice formula that describes a relation between inte- 
ger points in a polygon and its Euclidean area. 


Theorem 2.21. (Pick’s formula.) The Euclidean area S of an integer polygon sat- 
isfies the following relation: 
S=I+E/2-1, 


where I is the number of integer points in the interior of the polygon and E is the 
number of integer points in the edges. 


For the integer area one should multiply the right part of the formula by two. 


Example 2.22. For instance, for the following pentagon 


the number of inner integer points equals 4, the number of integer points on the 
edges equals 6, and the area equals 6. So, 


6=446/2-1. 


Proof. We prove Pick’s formula by the induction on the area. 
Base of induction. If the Euclidean area of an integer polygon is 1/2, then by Propo- 
sition 2.15 the polygon is an empty triangle. For the empty triangle we have 


S=043/2-1=1/2. 


Induction step. Fix k. Suppose that the statement holds for every integer polygon of 
area k’/2, where k’ < k. Let us prove the statement for polygons of area k/2. Let 
P be an integer polygon of area k/2. Then it can be decomposed into two integer 
polygons P, and P) (for instance as in the proof of Proposition 2.17 (i)) intersecting 
in an integer segment A;A;. Suppose that P; has J; interior points and E; boundary 
points. Let A;A ; contain E integer points. Since the areas of P; and P) are less than 
k/2, by the induction assumption induction we have 


S(P) = S(P,) +S(P:) = + £,/2-1+h+E,/2-1 
=(h+bh+E£)+(£,+E)—-2E -2)/2-1. 


Since the number of inner integer points of P is J; ++£ and the number of bound- 
ary integer points is Ej +E)—2E —2, Pick’s formula holds for P. This concludes the 
induction step. 
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We have formulated Pick’s theorem traditionally in Euclidean geometry. It is 
clear that this theorem is not true for all lattices. Still, the lattice analogue of the 
theorem holds for all lattices. 


2.5 Integer-regular polygon 


Let us first introduce several general notions from the theory of convex polygons. 
Recall that a convex polygon in the plane is the convex hull of a finite set of points 
that are not contained in one line. A point v of a polygon P is a vertex if for every pair 
of distinct to v points (v1,v2) of the polygon P the segment v; v2 does not contain v. 


Definition 2.23. A flag F = (v,@) is a collection of a point v and a line 2 satisfying 
the condition v € @. 

We say that a flag F = (v,) is a flag of a polygon P if the following three conditions 
hold: 

— the point v is a vertex of P; 

— the line @ contains another vertex of P distinct to v; 

— the polygon P is contained in one of the half-planes with respect to @. 


A polygon is integer if all its vertices are integer points. 
Now we are in position to define integer-regular polygons. 


Definition 2.24. An integer polygon is said to be integer-regular if for any pair of 
its flags there exists an integer affine transformation of the plane sending the first 
flag to the second one. 


Finally let us define elementary integer polygons. 


Definition 2.25. We say that a point v = (tx,ty) is a t-dilate of the point w = (x,y), 
we write v = tw. 

An integer polygon P; with vertices (tv1,...,fv,) is said to be the t-dilate of the 
polygon P) with vertices (v1,...,v,), denote it by P} =1P). 

An integer polygon is elementary if it is not the t-dilate of an integer polygon for 
every integer ft > 1. 


Despite the situation in the Euclidean geometry with regular n-gons for every 
integer n > 3, there are only finitely many elementary integer-regular polyhedra in 
the integer lattice case. They are all either triangles, or quadrangles, or hexagons. 
Let us list all of them. 


Proposition 2.26. Any elementary integer-regular polygon is integer congruent to 
one of the following 6 polygons: 
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We formally denote them by {3}%, {3}%, {4}%, {414, {6}4, and {6}% following 


Schléfli notation for the Euclidean case (for more details see Section 18.6). 


We refer to [100] for the proof of Proposition 2.26. 
Later in Section 18.6 we show the classification of the elementary integer-regular 
polyhedra in higher dimensions. 


2.6 The twelve-point theorem 


Let us also mention here an interesting theorem coming from the theory of toric 
varieties (see [66] and [117]; several interesting proofs of this theorem can be found 
in [181] and [185]). 

First we give a definition of a dual polygon. 


Definition 2.27. Let P = A, ...A, be a convex integer polygon. Suppose that O is 
the only integer point in the interior of P. Draw the integer vectors OB,,...,OB, of 
unit integer length and parallel to A,A2,...,A,A, respectively. The polygon P* = 
B,...By is said to be dual to P. 


The following statement holds. 


Theorem 2.28. (The Twelve-Point Theorem.) Suppose that P is a convex integer 
polygon containing a single integer point in its interior. Denote by E and E* the 
number of integer points in the boundary of P and P*. Then we have 


E+E* =12. 


We leave the proof of this theorem as an exercise for the reader. 
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2.7 Exercises 


Exercise 2.1. For any triangle AABC, show that 
1S(AABC) = 1€(AB)1d(C,AB). 


Exercise 2.2. Let ¢;, 2, and 3 be three parallel integer lines and let A; € J; be 
integer points. Suppose also that the lines ¢; and 3 are in different half-planes with 
respect to the line ¢j. Prove that 


1d(A1, €3) = Id(Ay , 2) +1d(Az, é3). 


Exercise 2.3. Geometric interpretation of integer distance. Let 1d(O,AB) =k. 
Prove that there are exactly k — | integer lines parallel to AB such that the point O 
and the segment AB are in different half-planes with respect to these lines. 


Exercise 2.4. Find an example of two integer noncongruent triangles that have the 
same integer area. 


Exercise 2.5. Consider an empty tetrahedron in R? with vertices in Z>. Suppose 
that it does not contain other points of the lattice Z>. Is it true that the edges of the 
tetrahedron generate the whole lattice Z?? 


Exercise 2.6. For any triangle AABC, show that 
1S(AABC) = 1€(AB) 1¢(BC) la(ZABC). 


Exercise 2.7. Prove that the index of a subgroup generated by v and w in the integer 
lattice is the absolute value of det(v, w). 


Exercise 2.8. Find a proof of the twelve-point theorem. 


Exercise 2.9. Classify all integer-regular polygons (namely, prove Proposition 2.26). 


Check for 
updates | 


Chapter 3 
Geometry of Regular Continued Fractions 


Continued fractions play an important role in the geometry of numbers. In this chap- 
ter we describe a classical geometric interpretation of regular continued fractions in 
terms of integer lengths of edges and indices of angles for the boundaries of convex 
hulls of all integer points inside certain angles. In the next chapter we will extend 
this construction to construct a complete invariant of integer angles. For the geom- 
etry of continued fractions with arbitrary elements see Chapter 15. 


3.1 Classical construction 


Let us start with the classical geometric construction of continued fractions. 

Consider an arbitrary number a > 1. Denote a ray {y = ax | x > 0} by rq. This 
ray divides the first quadrant { (x, y)|x,y > 0} into two angles. Consider one of them 
and take the convex hull of all the integer points except the origin inside this angle. 
The boundary of the convex hull is a broken line, which is called the sail of the 
angle. The same construction is applied to the second angle. See Fig. 3.1 for an 
example when @ = 7/5. 

If @ is a rational number, then both sails of the angles consist of finitely many 
segments and two rays, as in the example of a@ = 7/5. Denote the vertices of the 
broken line in the sail of the angle containing (1,0) by Ao,...,A, starting with Ag = 
(1,0). In the same way we denote the vertices of the broken line in the sail of the 
angle containing (0,1) by Bo,...,Bm, starting with By = (0, 1). We call these broken 
lines the principal part of the sails. As we will show later, m =n —1 orm=n. 

In the case of an irrational a, each of the sails is a union of one ray and an infinite 
broken line. Denote the broken line starting from (1,0) by AoA, ..., and the broken 
line starting from (0,1) by BoB, ... respectively, and call them principal parts of the 
sail. 

It is interesting to note that in the case of @ = 7/5, shown in Fig. 3.1, the ratios 
of the coordinates of extremal points of the convex hulls 
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Fig. 3.1 The sails for 7/5. 


Ai =(1,1), B, = (2,3), By = Az = (5,7) 


are convergents to the number 7/5. This does not happen by chance for 7/5; such a 
situation is typical. In the next theorem we show a general relation between A; and 
B; and the convergents for an arbitrary real number a > | (we discuss the case of 
0 <a <1 later, in Theorem 3.5). 


Theorem 3.1. Consider a > 1. Let AgA,A2... and ByB,B2... be the principal parts 
of the corresponding sails (finite or infinite). Then 
Aj = (gai-2,P2i-2) and B;= (gai-1,pai-1), i=1,2,... , 


where px/qx are convergents. The only exception is for the rational case for the 
following reason: the last vertices of the principal parts for both sails coincide with 
(Gn; Pn), Where Pn/qn is the last convergent, i.e., & = Pn/ qn. 


Remark 3.2. In terms of continuants the expressions of Theorem 3.1 where a@ = 
[a9;a1 : ---] is the regular continued fraction are as follows: 


Aj = (K2i-2(a1,---,@2i-2), K2i-1(Go,a1---,@2i-2)) and 
Bi = (Koi-1(a1,---,@2i-1),K2i(ao,a1 ---,€2i-1)), 
i=1,2,... (with a similar exception for the last value for rational @). 
We start the proof with the following lemma. 


Lemma 3.3. (a) The segment with endpoints (q2x-2, Pox—2) and (42x, P2x) is in the 
sail AjA\A2.... 
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(b) The segment with endpoints (q2x-1, P2x-1) ANd (qoe41,P2K+1) is in the sail 
BoB, Bo.... 


Proof. We start with item (a). First, notice that both points (¢2,_2, p2x-2) and 
(2k, P2k) are in the convex hull of the sail including ApA1A2..., since 


gee and on > Pak 


q2k-2 q2k 


Consider the line / passing through the points (¢2,_2, p2x-2) and (q2x, p2x). From 
Proposition 1.13 we know that all integer points on / are as follows: 


(G2k—2, P2k-2) +A(qox-1,Pp2x-1), A EZ. 


Let us prove that the line / is at unit integer distance to the origin. The inte- 
ger distance is equivalent to the index of the sublattice generated by the vectors 
(q2k—2, P2k—2) and (qok—1, P2k-1). From Proposition 1.15 it follows that 


|P2k—-292k—1 — P2k-192k-2| = 1, 


i.e., the Euclidean area of the corresponding triangle is 1/2. Hence by Proposi- 
tion 2.15 these vectors generate the lattice, and Id ((0,0),/) = 1. 

Therefore, there is no integer point in the interior of the band between / and the 
line parallel to / and passing through the origin. 


It is clear that the point (q2x_2, Pox-2) — (G2x—1, P2k-1) has nonpositive coordi- 
nates, and therefore it is not in the cone. 
Let us show that the point 


(G2k—25 P2k—2) + (ox + 1) (qax—1, P2k-1) = (42k; Poe) + (G2K—1, P2k-1) 


is not in the cone. Notice that the point (¢2,, pox) + (G2k—1, P2k-1) belongs to the 
segment with endpoints 


(Gox—1,P2k—-1) aNd (G2e41, P2K+1) = (G2k—15 P2k—1) + 42Kk-+41 (92k, P2k) 


which is contained in the other cone of the quadrant. 
Therefore, the segment with endpoints (q2x_2, pax—2) and (ox, p2x) is contained 
in the sail AjA;A2... (see Fig. 3.2). 


The proof for the second item is similar. 


Proof of Theorem 3.1. Actually, Lemma 3.3 almost proves the theorem. We have to 
check only the endpoints of broken lines. 

First, note that Ay = (1, ||), i-e., Ay = (go, Po). 

Second, (if a ¢ Z), we have 


By = (0,1) +ai(1, [@J) = (41, P1). 
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Fig. 3.2. The segment with endpoints (q2x_2, pox—2) and (q2x, prx) is in the sail. 


Finally, we have to check in the case of rational a@ whether the segment with 
endpoints (qn—1,Pn—1) and (qn,Pn) for the last two convergents is in one of the 
two sails. The point (q,—1,Pn—1) is in one of the two sails by Lemma 3.3. The 
point (qn,Pn) is in the intersection of the sails. The last thing we have to show 
is that the triangle with vertices (0,0), (Pn—1,Gn—1), and (qn, Pn) is empty. From 
Proposition 1.15 it follows that the Euclidean area of the triangle is 1/2; hence 
by Proposition 2.15 the triangle is empty. Therefore, the segment with endpoints 
(Gn—1;Pn—1) and (gn, Pn) is in the sail. 


We have found all the segments of the principal parts of both sails, concluding 
the proof. 


Remark 3.4. In the case of rational @ with the principle parts of the corresponding 
angles A; ...A, and B,...Bm, we have the following. If the last element of the odd 
continued fraction is 1, then n = m-+ 1, otherwise n = m. 


Let us formulate a similar theorem for the case 0 < a < 1. 


Theorem 3.5. Consider 0 < a < 1. Let AgAjA2... and ByB,B2... be the principal 
parts of the corresponding sails (finite or infinite). Then 


Aj = (q2i,P2i) and Bj; = (qzi-1,pri-1), i=1,2,... 


where px/qx are convergents. The only exception is when @ is rational, in which 
case the last vertex of the principal parts for both sails coincide with (qn, Pn), where 
Pn/Qn is the last convergent, i.e. & = Pn/ Qn. 


Proof. The proof repeats the proof of Theorem 3.1 except for the following differ- 
ence. Since ag = 0, the point Ag should coincide with A. This is the explanation for 
the shift in indices. 
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Remark 3.6. Similarly to Theorem 3.1 we have the following continuant expression 
for Theorem 3.5: 


A= (Koi(a1,..-, 42), K2i41(0,a1 wire ,a2;)) and 
B; = (Kai-1(a1,...,@2i-1), Kai(0,a1...,a2i-1)), 


i=1,2,..., where & = [0;a, :---] is the regular continued fraction (with a similar 
exception for the last value for rational a). 


3.2 Geometric interpretation of the elements of continued 
fractions 


Let us first formulate a corollary to Theorem 3.1. 
Corollary 3.7. Consider a > 1. Let AgA,\A2... and ByB,B2... be the principal 
parts of the corresponding sails (finite or infinite). Then 


1(AjAiv1) = a2; and 1¢(B;Bis1) =a241, i=0,1,2,... , 


where Ot = [ag;a1 : a2: ---]. The only exception occurs when @. is rational, in which 
case the last edges of the principal parts for both sails coincide with (qn, Pn), where 
Pn/Qn is the last convergent, i.€., 0 = Py/ Qn. The integer length of this edge is 1. 


In the case of 0 < @ < 1, the corollary also holds, the only difference being that 
10(AjAi+1) = aj+2 instead ap;. 


Proof. The corollary follows directly from the explicit formulas for A, and By of 
Theorem 3.1, after applying Proposition 1.13. 


Theorem 3.1 and Corollary 3.7 lead to an interesting algorithm for constructing 
sails for regular continued fractions. 


Algorithm to construct sails of real numbers. 


Input data. Given a regular continued fraction & = [ag;a, : a2: ---] with ag > 0. 
Set O = (0,0). 


Goal of the algorithm. To construct the sails for a. 


Step 1. Assign Ag = (1,0), Bo = (0, 1) and construct 


A; =Ao+aoO0By and further B, = Bo+a,OA\. 


Inductive Step k. Suppose that we have already constructed Ag...A, and Bo... Bg. 
Then put 
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Ay = Ao + agOBo By = Bo + a,OA, Ao = Ay + a,gOB, 


Fig. 3.3 Construction of the continued fraction for 7/5. 


Agy1 =Ag+42,OB, and Buy, = By t+ a2%41OAK41. 


Output. If a is rational, then the algorithm constructs both sails in finite time. If 
@ is irrational, then the algorithm also calculates both sails but in infinitely many 
steps. 


We leave the case 0 < @ < | as an easy exercise for the reader. 
In Fig. 3.3 we show the steps of the algorithm for the particular case @ = [1;2: 2]. 


3.3 Index of an angle, duality of sails 


In this subsection we show that there exists a duality between edges and angles. An 
important consequence of this duality is that all the elements of the regular continued 
fraction can be read from one of the sails. We restrict ourselves to the case @ > 1. 
Reduction to the case 0 < @ < 1 is straightforward, so we omit it. 


Theorem 3.8. Consider a > 1. Let AgA,A2... and ByB,B2... be the principal parts 
of the corresponding sails (finite or infinite). Then 


la(ZAjAj41Ai42) =a2)41 and 1a(2B;Bj+,Bi+2) = ari42 


for all admissible indices, where & = {ao3a1 : az: --:]. 


Proof. Let us calculate the index of an integer angle at some vertex of the principal 
part of one of the two sails. By Theorem 3.1 it is equivalent to calculate the index of 
the angle between a pair of vectors (q;-1, p;-1) and (qi41, Pi+1): 


lo (Z(qi-1, Pi-1) (0, 0) (gi41, Pi41)) = |Pi-19i41 — Pit 1Gi-1| = 
|Pi-1(@i4195 + 9i-1) — (@it1 pit Pi-1)Gi-1| = @i41|Pi-19i — Pigi-1| = Gi41. 
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y = 7/5x 


OY 


O 


Fig. 3.4 The edge—angle duality for 7/5. 


The second equality follows from Proposition 1.13, while the last follows from 
Proposition 1.15. O 


Edge-angle duality. From Corollary 3.7 and Theorem 3.8, we get that 
la(ZAjAj+1Ai42) =1¢(BiBiz1) and 1o0(ZB;Bj+1Bi+2) = 1e(Ai+1A4i42). 


The only exception is the very last angle (for rational slopes) where 1@ is smaller 
than the corresponding 1¢ by one. 
For the example of @ = 7/5 (see Fig. 3.1 and 3.4), we get 


la(ZA0A1A2) = 1¢(BoBi) = a1 = 2; 
lo (ZBoB,B2) =1¢(AjA2) —1l =a.—-1=1. 


Remark 3.9. We return to edge-angle duality a little later in the slightly different 
setting of LLS sequences (see Proposition 5.17). 


3.4 Exercises 


Exercise 3.1. Prove that every sail is homeomorphic to R!. 
Exercise 3.2. Describe the edge—angle duality for the casee0 <a <1. 


Exercise 3.3. Construct the sails of continued fractions for rational numbers 3/2, 
10/7, 18/13, and 13/18. 


44 3 Geometry of Regular Continued Fractions 


Exercise 3.4. Find geometrically all the convergents to the rational numbers 8/5, 
9/4, and 17/8. 


Exercise 3.5. Is it sufficient to know all indices for the angles in the principal parts 
of the sail defined by a > | and the indices of angles of its dual sail to reconstruct 
Q@ in a unique way? The question is interesting for both of the following cases: if a 
is rational and if & is irrational. 


Check for 
updates | 


Chapter 4 
Complete Invariant of Integer Angles 


In this chapter we generalize the classical geometric interpretation of regular contin- 
ued fractions presented in Chapter 3 to the case of arbitrary integer angles, construct- 
ing a certain integer broken line called the sail of an angle. We combine the integer 
invariants of a sail into a sequence of positive integers called an LLS sequence. From 
one side, the notion of LLS sequence extends the notion of continued fraction (see 
Remark 4.8), about which we will say more in the next chapter. From another side, 
LLS sequences distinguish the integer angles. Sails and LLS sequences of angles 
play a central role in the geometry of numbers. In particular, we use LLS sequences 
in integer trigonometry and its relations to toric singularities and in Gauss’s reduc- 
tion theory. F. Klein generalized the notion of sail to the multidimensional case 
to study integer solutions of homogenous decomposable forms. We will study this 
generalization in the second part of this book. 

In Section 4.1 we give a preliminary definition of an integer sine. Further, in 
Section 4.2, we define sails of integer angles and corresponding LLS sequences. 
Further, in Section 4.3 we prove that an LLS sequence is a complete invariant of an 
integer angle. We discuss angles sharing the same edge in Section 4.4. We conclude 
in Section 4.5 demonstration of two ways to compute the LLS sequence for a given 
angle. 


4.1 Integer sines of rational angles 


Recall the following general definitions. An angle is integer if its vertex is an integer 
point. If the integer angle ZABC has integer points distinct from B on both edges 
AB and BC, we call it rational. 


Let us introduce the notion of the integer sine function for rational angles. We 
put in the definition an integer analogue of the sine formula of the Euclidean case. 
In this section we consider an integer angle ZABC with integer vertex B. 
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Definition 4.1. Let ZABC be a rational angle, where A and C are lattice points 
distinct from B. The integer sine of the angle is the following number: 


IS(AABC) 
10(AB) 1€(BC)’ 


denoted by Isin ZABC. 


Remark 4.2. Notice that the integer sine is well defined, i.e., it does not depend on 
the choice of points A and C. We leave the reader to check this as an exercise. 


Let us briefly compare sin and Isin. One difference is in the multiplicative con- 
stant 1/2. This is due to the fact that an empty triangle has integer area 1 but Eu- 
clidean area 1/2, as shown in Proposition 2.15. Yet the difference between sin and 
lsin is much stronger, as illustrated in the following proposition. 


Proposition 4.3. The integer sine of a rational angle coincides with the index of the 
angle. 


In particular, this implies that the integer sine takes all possible nonnegative in- 
teger values. 


Proof. Consider a rational angle ZABC with A, B, C not contained on one line. Let 
A’ and C’ be the nearest integer points to B in the open rays BA and BC, respectively. 
Then from the definition of integer length, we get 


1¢(BA’) = 1¢(BC’) = 1. 
Hence, 
Isin(ZA’BC’) = 1S(AA‘BC’). 


The integer area of AA’BC’ is the index of the sublattice generated by BA’ and BC’ 
in Z*. Since the vectors BA’ and BC’ generate all integer points of the lines AB and 
BC, the integer area of AA’BC’ is equivalent to the index of the angle. Therefore, 
we get 

Isin(ZABC) = la(AABC). 


If A, B, and C are on one line, then 


Isin(ZABC) = la(AABC) =0. 


This concludes the proof. 


4.2 Sails for arbitrary angles and their LLS sequences 


In Chapter 3 we studied sails for a certain subset of integer angles. Let us extend the 
notion of sails to the case of arbitrary integer angles. 
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Fig. 4.1 An angle and its sail. 


Definition 4.4. Let a be an arbitrary integer angle. Consider the convex hull of the 
set of all integer points in @ except the origin. The boundary of this convex hull is 
called the sail of this angle. 


Remark 4.5. Notice that the sail of a lattice angle is a broken line homeomorphic to 
R. A sail is either a two-sided infinite broken line or a one-sided infinite broken line 
including also a ray at one side or a finite broken line that includes two rays at both 
sides. These rays appear when edges of angles contain integer points other than the 
vertex of the corresponding angle. 


Let us now define a very important characteristic of the sail that we will use in 
the future. 


Definition 4.6. Consider an arbitrary angle @ with integer vertex. Let the sail for 
this angle be a broken line with sequence of vertices (A;). Define: 


zy, = AA, 
ax—1 = Isin(ZAx_1AgAx +1) 


for all admissible indices. The lattice length sine sequence (LLS sequence for short) 
for the sail is the sequence (a,). Denote it by LLS(a@) 


The LLS sequence can be either finite or infinite on one or both sides. 


Example 4.7. Consider an example of the integer angle a centered at the origin O 
with edges passing through points A = (2,—1) and B = (3,2) respectively. Denote 
by C the point (1,0). Then the broken line ABC is the sail for a (see Fig. 4.1). Now 
we have 

1¢(AC) = 1, Isin(ZACB) =2, and 1¢(CB) =2. 


Hence 
LLS(q@) = (1,2,2). 


Remark 4.8. Notice that LLS sequences contain only positive integer elements. 
Consider a finite to the left LLS sequence. Regular continued fractions define a one- 
to-one correspondence between such LLS sequences and real numbers not smaller 
than 1; namely, the sequence (ao, a1,...) corresponds to the real number [ag;a1 : ---]. 
This continued fraction is an important invariant of integer angles (we will call it the 
integer tangent), and we will study this invariant in the next chapter. 
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4.3 On complete invariants of angles with integer vertex 


Proposition 4.9. Integer length, integer area, index and integer sine of a rational 
angle are invariant under the action of the group of integer affine transformations 


Aff(2,Z,). 


Proof. Every integer linear transformation sends subgroups to subgroups and the 
corresponding cosets to the corresponding cosets; hence all integer indices are in- 
variant under integer linear transformations. Thus, every integer affine transforma- 
tion preserves indexes of the corresponding integer lattice subgroups in Z? as well. 
Therefore, the integer area and the index (and hence the integer sine) of a rational 
angle are also preserved. The integer length is preserved, since all the inner integer 
points map to inner integer points. 


Corollary 4.10. The LLS sequence is an invariant of lattice angles with respect to 
Aff(2, Z). 


Proof. First, note that convex hulls are preserved by the elements of Aff(2,Z) (we 
leave this as an easy exercise for the reader). Then the statement follows directly 
from the fact that the integer length and the index are invariants of Aff(2,Z). 


It is interesting to note that the angles ZABC and ZCBA are not necessarily inte- 
ger congruent (i.e., there is no integer affine transformation taking one to another). 
For example, if we consider 


A= (3,—2), B= (0,0), and C= (2,1), 


then ZABC and ZCBA are not integer congruent. We will say more about such pairs 
of angles in the next section. 


Theorem 4.11. Two angles with vertices at integer points are lattice congruent 
if and only if they have the same LLS sequences (here we assume that the subse- 
quences of lengths and angles of one LLS sequence respectively coincide with the 
subsequences of lengths and angles of the other). 


Proof. From Corollary 4.10 we know that the LLS sequence is an integer invariant 
of angles. It remains to prove that if two LLS sequences coincide, then the corre- 
sponding angles are integer congruent. 

Consider two angles @ and £ with sails (A;) and (B;). Let the corresponding 
LLS sequences coincide and be equivalent to (a;). Let the vertices of the angles a 
and B be Og and Og respectively. Consider an affine transformation taking Og to 
Og, Bo to Ag, and B, to Ay. This affine transformation is integer, since 1¢(AgA1) = 
1¢(BoB1) = ao and Id(Og,AoA1) = Id(Og,BoB1). Suppose the angle f is taken to 
some angle y with sail (C;) (we already know that the vertex of y is Og, Co = Ao, 
and C; = A)). 

Let us prove that the broken lines AgA}A2... and CoC) C2... coincide by induc- 
tion. Suppose ApA, ...Axz_; coincides with CoC, ...C,_1. Let us prove that Ay = Cy. 
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First, we know that 
Isin(ZA,_2Ag—1Ax) = Isin(ZC,_2Cy_1 Cx) = arx—3 


and 
1(Ag_1Ax) = 1e(Cy_1 Cy) = arx_2. 


Hence we have 


1d(Ag,Ag—2Ag—1) = 1d (Cy, Ch—2Ck_-1) = 424-2423. 
This follows from the simple fact (see Exercise 4.6) that 


1S(ZABC) 


ld(A, BC) = “BO 


Notice that ZA,_2Ax_;Az and ZCy_2C,_1C; are two parts of convex sails, and 
hence the points A; and C; are on the other side of the point Og with respect to the 
line Ag_2Ax_1 = Cy_2Cy_1. Hence Ax and Cy are both on a line /; parallel to the line 
Ax_—2Ax_1 containing points at integer distance a2,_2d42,_3 from the line Ay_2Ax_. 

Second, we have 


Id(Ag, OgAn—1) = 1S(ZOgAg—1Ak) = G2x—1 = 1S(ZOaCK-1 Cx) = Id (Ck, OaCk-1), 


where OgAx_1 = OaCk_1. Again, by convexity we know that the points A; and C; 
are in a different half space from the point Ay;_2 = Cy_2 with respect to the line 
OgAx_1. Therefore, C, and A; are on a line /y parallel to OgAg_. 

Since OgA,_; and Ay_2Ax_ 1 are not parallel, the intersection of lines /; and /y is 
a point coinciding with both A, and Cx. 

Therefore, the broken lines AgA;A2... and CoC C2... coincide. 

The fact that ...A_,AgA; and ...C_,CoC, coincide follows from the consid- 
ered case after performing a GL(2,Z) transformation taking Ag to A; and A; to 
Ao, namely the transformation 

1 0 
(on) 


We apply this transformation to both sails (A;) and (C;) and get that the images of 
C, and A, coincide for every negative k. Therefore, the whole sails (A;) and (B;) 
coincide. 


Theorem 4.12. For every sequence of positive integers (odd finite or infinite on one 
or both sides) there exists an angle with vertex at the origin whose LLS sequence is 
this sequence. 


Proof. First we consider the case of a sequence do,a1,a2,... (odd finite or infinite 
to the right). Let A = (1,0), B = (0,0), and C = (1,q@), where [ag;a1 : az: ---] is 
the continued fraction for @. The angle ZABC has the desired LLS sequence. This 
follows directly from Corollary 3.7 and Theorem 3.8. 
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Fig. 4.2 The sails for @ and B coincide starting from some vertex. 


When the sequence is infinite to the left, we construct the angle ZABC for the 
inverse sequence. Then the angle ZCBA is the angle with the prescribed LLS se- 
quence. 

The remaining case is the case of both-side infinite sequences. Here we construct 
the angle for the part with nonnegative indices. Then apply to this angle the trans- 
formation we used in the proof of Theorem 4.11 and construct the angle for the 
remaining part of the sail and get the angle with the prescribed LLS sequence. 


4.4 Equivalent tails of the angles sharing an edge 


Let us prove the following remarkable proposition (see also in [3], pp. 24—25). 


Proposition 4.13. Let a > Q2 > 3 be distinct real numbers and let a be ir- 
rational. Consider two angles a and B defined by pairs of rays in the lines 
(y = Qx,y = Ox) and (y = x,y = 03x), where x > 0. Then the LLS sequences of 
these two angles coincide from some element (up to a sequence shift). 


Proof. Denote by ¥ the angle defined by rays (y = Qox,y = 03x), x > 0. It is clear 
that every integer point of the angle f is either in @ or in y. Then the convex hull 
of all integer points of B is the convex hull of the union of the convex hulls of all 
integer points for @ and y. Hence the sail for B is contained in the sails for @ and y 
except for one edge, which represents the common support line to the convex hulls 
for a and B (see Fig. 4.2) Therefore, the sails of a and B coincide starting from 
some vertex. Hence, the LLS sequences coincide as well. 


From duality reasons we immediately have the following statement. 


Corollary 4.14. Let d > GQ > Qs be distinct real numbers and let Qy be irrational. 
Consider two angles a and B defined by pairs of rays in the lines (y = x,y = bx) 
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and (y = x,y = 03x), where x > 0. Then the LLS sequences of these two angles 
coincide from some element (up to a sequence shift). 


4.5 Two algorithms to compute the LLS sequence of an angle 


In this section we describe two algorithms to write LLS sequences for a given ratio- 
nal angle. The first one requires a brute force study of rather small amount of cases. 
For the second method we provide explicit formulae for the LLS sequence in terms 
of coordinates of the integer vectors defining a given rational angle. 


4.5.1 Brute force algorithm 
Let us start with an algorithm that includes some brute force study of a curtain 
number of cases. 


Brute force algorithm to find the LLS sequence for an angle 


Input data. We are given an integer angle ABC. In particular we have the coordi- 
nates of the two integer nonzero vectors 


BA=(p,q), and CA=(r,s). 


Goal of the algorithm. To construct the LLS sequence of the angle ZABC. 


Step 1. First of all we compute primitive integer vectors on the edges BA’ = (p’,q’) 
and BC’ = (r’,s’) where 


(7,8) 
gcd(r,s)” 


ron (p,q) a hs 
Pat sedan) d ( ? ) 


Step 2. Secondly we compute integer sine of the angle by the formula 


ty! 
det ¢ ai 


(here we take the absolute value of the determinant). In the case where integer sine 
is 0 we have a degenerate angle (either 0 or 7) and we do not construct an LLS 
sequence for it. 


Isin ZABC = lsin ZA’BC’ = 


Step 3. Now let the integer sine be nonzero. Then 
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ZABC = larctan ey 3 


where k is one of the integer number satisfying 
ged(k,lsin ZABC) =1, and 1<k< sin ZABC. 


In this step we use brute force method to find the value of k by comparing A’BC’ 
with 
Isin ZABC 

k 


lsin ZABC 
k 


larctan 


for all admissible k. Here note that larctan 


is defined by the vectors 
vj =(1,0), and v2 = (Isin ZABC,k). 


There will be precisely one value of k that will give a positive test. 


Output. The sequence of elements of the odd regular continued fraction for the 


rational number 
lsin ZABC 


k 
is the LLS sequence for the angle ZABC. 


Example 4.15. Consider the angle @ = ZAOB with 
A=(10,4) and B= (12,21). 


Let us compute its LLS sequence. 
First of all, we find the primitive vectors parallel to OA and OB. They are 


OA' = (5,2) and OB’ = (4,7). 


54 
sa(54)]=27 


Now we need to check all values of 0 < k < 27 that are not divisible by 3. 


For k = 1: the vectors defining larctan(27/1) are (1,0) and (1,27) respectively. 
Hence the transition matrix between the angles is as follows: 


L1\ (54\7'_ (5/271/27 

0 27 27 ~\ -2 5 , 
It is clear that this matrix is not in GL(2,Z), and hence by Proposition 2.10 the 
angles are not integer congruent. The same happens for all admissible k except for 


k = 17. (We omit computations for the cases k 4 17.) If k = 17 then we compare 
with larctan(27/17). The transition matrix is as follows: 


Then we have 


lsina = 
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ea ; eax = és €GL(2,Z). 


Hence by Proposition 2.10 


a = larct eal 
= larctan —. 
17 


Note that the odd regular continued fraction for 27/17 is 


27 
11 12:3}. 
7 >| 
Therefore the LLS sequence for & is 
(1,1,1,2,3). 


4.5.2 Explicite formulae for LLS sequences via given coordinates 
of the angle 


It turns out that it is possible to write down an explicite expression for the elements 
of the LLS sequence for angles due to the following remarkable formula. 


Theorem 4.16. Consider two linearly independent integer vectors 
OA=(p,q) and OB=(rs) 


such that none of them are proportional either to (1,0) or to (0,1). Further let two 
sequences of integers 


(a0,41,---,@am) and (bo,a1,...,b2n) 


be defined as the sequences of elements of odd regular continued fractions of ratio- 
nal numbers 

* |q/p| and |s/r| if det(OA, OB) - sign 2 <0; 

+ |p/q| and \r/s| if det(OA, OB) -sign ? > 0. 


Set 7 
e= —sign? and 6 =sign-. 
q Ss 
Denote 
O = [Edom : Ed2m—1 +++ Ea, 2 €ag:0: dbo: 6b, : ++ : Sboa] 
and let 
|ot| = [coscy 2 -++ Cog] 


be the regular odd continued fraction for |Q|. Finally we set 
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° S=(co,C1,---,Cox) in case co £0; 
* S=(c2,...,C2~) in case co =0. 


Then S is the LLS sequence for the angle ZAOB. 
Remark 4.17. Currently we do not have all necessary tools for the proof of Theo- 


rem 4.16. We do it later in Subsection 15.2.4, when the LLS-sequence for broken 
lines is introduced. 


Remark 4.18. (A shorter formula that allows one to write integer tangents for 
the angles with irrational value of s/t.) One can simplify the computation of the 
continued fraction for 


O = [Edam : Ed2m—1 2 +++ Ea 2 Eg: 0: bb: bby: +++: Oba]. 
Here 
O = [Erm 2 Edym_1 1 +++ 1 EA, : Eag 10: 6E] 


where 
_ fq/p, if det(OA, OB) - sign & <0; 
om ee otherwise. 
It is interesting to note that the last expression gives an option to find integer tan- 
gent of the corresponding angle even if € is irrational, and the corresponding LLS 
sequence is infinite. 


Remark 4.19. For the completeness of the observation let us consider the cases of 
vectors (1,0) and (0,1). 


Example 4.20. As above in Example 4.15 we consider the angle @ = ZAOB with 
A=(10,4) and B= (12,21). 


Let us compute its LLS sequence using the techniques suggested by Theorem 4.16. 
Note first that 


10 1012\ 10 
eet) si = 7 162 = 405 > 0, 


hence we consider 10/4 and 12/21 respectively. We have 
10 12 
€=—sign— =—1 d 6=sign—=1. 
sign 7 an sign = 
Further we have 
10 5 12s 2. 
—=-=([1,1,2], and —===(0;1:1:2:1]. 
4 2 21 #7 
So the expression for the long continued fraction is as follows: 


27 
—1;-1:-2:0:0:1:1:2:1)/=—-—. 
[ lear 
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Let us now write the odd continued fraction for | — 27/17|: 


27 
——|=/1;1:1:2:3). 
-3]=" 
Since the first element of the continued fraction is not equal to zero (1 4 0) the LLS 
sequence for & is 
(1,1, 1,2,3). 


4.6 Exercises 


Exercise 4.1. Show that the integer sine of a rational angle ZABC does not depend 
on the choice of integer points B and C on the edges. 


Exercise 4.2. Prove that convex sets are taken to convex sets and their boundaries 
to boundaries under affine transformations. 


Exercise 4.3. Let A = (3,—2), B = (0,0), and C = (2,1). Prove that ZABC and 
ZCBA are not integer congruent. 


Exercise 4.4. Prove that if the principal parts of the sails of two angles are integer 
congruent, then the angles themselves are integer congruent. 


Exercise 4.5. Let d be a positive integer and / an integer line. Prove that all lattice 
points lying at integer distance are contained in two lines parallel to / at the same 
Euclidean distance from /. 


Exercise 4.6. Prove that 


ld(A, BC) = ——_ 


Exercise 4.7. Compute the LLS sequence for the angle AOB where O is the origin 
and the points A and B are as follows: 

(i) A = (5,7) and B = (11, —6); 

(ii) A = (1,0) and B = (4,-—7); 

(iii) A = (8,3) and B = (11,-1). 


Check for 
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Chapter 5 
Integer Trigonometry for Integer Angles 


In this chapter we explain how to interpret regular continued fractions related to 
LLS sequences in terms of integer trigonometric functions. Integer trigonometry 
has many similarities to Euclidean trigonometry (for instance, integer arctangents 
coincide with real arctangents; the formulas for adjacent angles are similar). From 
another point of view they are totally different, since integer sines and cosines are 
positive integers; there are two right angles in integer trigonometry, etc. In this chap- 
ter we discuss basic properties of integer trigonometry. In Chapter 6 we use integer 
trigonometric functions to describe angles of integer triangles, which will further 
result in global relations for toric singularities on toric surfaces (see Chapter 17). 

For rational angles we introduce definitions of integer sines, cosines, and tan- 
gents. In addition to rational integer angles, there are three types of irrational inte- 
ger angles. If an integer angle ZABC has an integer point distinct from the vertex B 
in AB but not in BC (in BC but not in AB), we call the angle R-irrational (or respec- 
tively L-irrational). In case both edges of an angle do not contain lattice points other 
than B, the angle is called LR-irrational. It is only for R-irrational angles that we 
have a definition of integer tangents. The trigonometric functions are not defined for 
L-irrational and LR-irrational angles. For more information on integer trigonometry 
we refer to [102] and [104]. 


5.1 Definition of trigonometric functions 


We start with the definition of the integer tangent for rational and R-irrational angles. 


Definition 5.1. Consider a rational or R-irrational angle ZABC. Let A, B, and C 
be noncollinear. Suppose the LLS sequence of ZABC is (ao,a1,a2,...) (finite or 
infinite). Then the integer tangent of the angle ZABC equals 


Itan(ZABC) — [40341 :a2: vee]. 
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When the points A, B, and C are collinear (but the points A and C are distinct from 
B) we say that Itan(ZABC) = 0. 


Consider a rational angle ZABC and recall that 


_ IS(AABC) 
Isin(ZABC) = WO(AB) (BC) 


Definition 5.2. For a rational angle @ we define 


It is clear that the integer sine, integer tangent, and therefore integer cosine are 
invariants of Aff(2,Z). 


Now we give the inverse function to the integer tangent. 


Definition 5.3. Consider a real & > 1 or @ = 0. The integer arctangent of a is the 
angle with vertex at the origin and edges 


{y =0|x>0} and {y = ax |x > O}. 


We define the zero angle as larctan0. The angle z is the angle ABC, where A = 
(1,0), B = (0,0), and C = (—1,0). 


5.2 Basic properties of integer trigonometry 


First we show that the integer tangent and arctangent are in fact inverse to each 
other. 


Proposition 5.4. (i) For every real s > 1, we have Itan(larctans) = s. 
(ii) For every rational or R-irrational angle a, the following holds: 


larctan(Itana@) = a. 


Proof. (i) From Corollary 3.7 and Theorem 3.8 we have that the elements of the 
LLS sequence for larctans coincide with the elements of the regular continued frac- 
tion for s. Hence the statement holds by definition of the integer tangent. 


(ii) Both angles larctan(Itana@) and @ have the same LLS sequences. Therefore, 
they are congruent by Theorem 4.11. 


In the following proposition we collect several trigonometric properties. 


Proposition 5.5. (i) The values of integer trigonometric functions for integer con- 
gruent angles coincide. 
(ii) The lattice sine and cosine of any rational angle are relatively prime positive 
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integers. 
(iii) For every angle o the following inequalities hold: 


Isina >lcosa@ and litana>1. 


Equality holds if and only if the lattice vectors of the angle rays generate the whole 
lattice. 

(iv) (Description of lattice angles.) Two integer angles a and B are congruent if 
and only if \tana = Itan B. 


Proof. (i) This is a direct corollary of the fact that integer sine and tangent are 
defined only by values of certain indices. 

(ii) By Proposition 5.4 it is enough to prove the assertion for angles larctana for 
rational ~ > 1. 

Consider * > 1, where m and n are relatively prime integers. Then the sail of the 
angle will contain the point (n,m). It is also clear that the lattice distance between 
the point (n,m) and the line y = 0 is m, and hence Isin (larctan “) = m. 

Since lItan (Jarctan m) = @ and Isin (Jarctan m\ =m, we have Icos (Jarctan m\ — 
n. 


(iii) This is true for all integer arctangent angles. Therefore, it is true for all angles. 


(iv) The LLS sequence is uniquely defined by the integer tangent. Therefore, the 
statement follows from Theorem 4.11. 


5.3 Transpose integer angles 


Let us give a definition of the integer angle transpose to a given one. 


Definition 5.6. The integer angle ZBOA is said to be transpose to the integer angle 
ZAOB. We denote it by (ZAOB)'. 


It immediately follows from the definition that for any integer angle @, we have 


Further, we will use the following notion. Suppose that some arbitrary integers 
a, b, and c, where c > 1, satisfy ab = 1(modc). Then we write 


a= (b(modc)) 


For the trigonometric functions of transpose integer angles the following rela- 
tions hold. 


Theorem 5.7. Trigonometric relations for transpose angles. Let an integer angle 
a be not contained in a line. Then 
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(1) If & = larctan(1), then a = larctan(1). 
(2) If ao F larctan(1), then 


Isin(a’) =Isina, —1cos(a’) = (Icos @(mod|sin @)) Pak 
Example 5.8. Let us check the statement of Theorem 5.7 for the simple case of 
a = larct i 
= larctan = 
5 


Since 10/7 = [1;2 : 2], the LLS sequence of a@ is (1,2,2). Hence the LLS sequence 
of the transpose angle is (2,2, 1). Now 


7 
a’ ~ larctan([2;2 : 1]) = larctan re 
In particular, 3-5 — 1 = 14 is divisible by lsina@ = 7, and hence we have 
Isin(a’) =7=Isina; Icos(a’) =3 = (5(mod7)) = (Icos @(mod|sina)) '. 


(See Fig. 5.1.) 


Proof. Consider an integer angle a. Let ltana = p/q, where gcd(p,q) = 1. By 
Proposition 5.4(ii), we have @ ~ larctan(p/q). The case p/q = | is trivial. Consider 
the case p/q > 1. 

Let A = (1,0), B = (g,p), and O = (0,0). Suppose that an integer point C = 
(q', p’) of the sail of the angle larctan(p/q) is the closest integer point to the end- 
point B. Both coordinates of C are positive integers, since p/g > 1. Since the triangle 
ABOC is empty and the orientation of the pair of vectors (OB,OC) does not coin- 
cide with the orientation of the pair of vectors (OA, OB), we have 


/ 
det (: 7) =-1. 
PP 


Consider a linear transformation € of the two-dimensional plane, 


Ee Ce )) 
P —4| 
Since det(€) = —1, the transformation & is integer-linear and changes the orienta- 
tion. Direct calculations show that the transformation & takes the angle larctan' (p/q) 


to the angle larctan(p/(p—p’)). (See the example in Fig. 5.1.) 
Since gced(p, p’) = 1 and p > p—p’, the following holds: 
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sires ea, 
9. j larctan(7/3) 
Dente ene tea 


Fig. 5.1 If @ is integer-congruent to larctan(7/5), then a is integer-congruent to larctan(7/3). 


Since gp'—pq' = —1, we have gp’ = —1(mod p). Therefore, 
Icos alcos(a') = q(p — p’) = 1(modp). 
From that we have 


Isin(ar') = Ising, 
Icos(a') = (Icos a(modIsin t)) *. 


This concludes the proof of Theorem 5.7. 


5.4 Adjacent integer angles 


Now we define an integer angle adjacent to a given one. 


Definition 5.9. An integer angle 7BOA’ is said to be adjacent to an integer angle 
ZAOB if the points A, O, and A’ are contained in the same straight line. We denote 
the angle ZBOA’ by n—ZAOB. 


For the trigonometric functions of adjacent integer angles the following relations 
hold. 
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Theorem 5.10. Trigonometric relations for adjacent angles. Let a be some inte- 
ger angle. Then one of the following holds: 
(1) If & is the zero angle, thenn-a = 1. 
(2) If & is the straight angle, thenn—a = 0. 
(3) If @ = larctan(1), then t—a@ = larctan(1). 
(4) If & is neither zero nor straight nor integer-congruent to larctan(1), then 
m—Q = larctan’ (tite) ; 
lsin(t#—a@) =Isin@, Icos(#—a) = (—1cos a(modlsin @)) me 


Example 5.11. Consider the integer angle 


larctan 228 
157° 


Then for the adjacent angle we have 


22 22 
lsin (x — arctan =) =225 and_Icos (x — arctan =) = 182, 


since 157-182-+ 1 is divisible by 225. This is precisely in accordance with Theo- 
rem 5.10. Hence 


225 ) 
m — larctan = larctan : 
157 182 
Remark 5.12. Suppose that an integer angle @ is neither zero nor straight. Then the 
conditions : 
Icos(#—a) = (—Icosa(modIsina)) , 
0 < Icos(a#—@) < Ising, 
uniquely determine the value Icos(7—@). 
Proof of Theorem 5.10. Consider an integer angle a. Directly from the definitions 
it follows that if a = 0, then m—a = 7, and, if a=7, then tT—a 0. 


Suppose that ltan a = p/q > 0, where gcd(p,q) = 1. Then by Proposition 5.4(ii) 
we have & = larctan(p/q). Therefore, 


m—O = m—larctan(p/q). 


It follows immediately that if p/q = 1, then m—a@ = larctan(1). 
Now let a # larctan(1), and hence p/g > 1. Consider a linear transformation & 
of the two-dimensional plane, 


Since det(&;) = —1, the transformation &, is integer-linear and changes the ori- 
entation. Direct calculations show that the transformation &, takes the cone cor- 


5.4 Adjacent integer angles 63 


Fig. 5.2 If @ is integer-congruent to larctan(7/5), then 2—@ is integer-congruent to larctan(7/4). 


responding to the angle 2—larctan(p/q) to the cone corresponding to the an- 
gle larctan'(p/(p—q)). Since & changes the orientation, we have to transpose 
larctan’ (p/(p—q)). (See the example in Fig. 5.2). Therefore, 


: ltan a 
m—Q = larctan | ——_——— ]}. 
Itan(a) — 1 
Now we show that 
Isin(a#—a@) = Ising, 
Icos(#—a) = (—Icos (mod |sin @)) ra 
Let A = (1,0), B = (gq, p), and O = (0,0). Consider the sail of the angle 
m—larctan(p/q), 


which is integer-congruent to 7—a@. Suppose that an integer point C = (q’, p’) of 
the sail for 7—larctan(p/q) is the closest integer point to the endpoint of the sail B 
(or equivalently, the segment BC is in the sail and has the unit integer length). The 
coordinate p’ is positive, since p/gq > 1. Since the triangle ABOC is empty and the 
vectors OB and OC define the same orientation as OA and OB, 
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/ 
det ic ) = 
PP 


Consider a linear transformation & of the two-dimensional plane, 


Since det(&) = 1, the transformation > is integer-linear and orientation-preserving. 
Direct calculations show that the transformation &) takes the integer angle 


m—larctan(p/q) 


to the integer angle larctan(p/(p—p’)). Since gcd(p, p’) = 1, we have Isin(7—a) = 
p. Since gcd(p, p’) = 1 and p > p—p’, the following holds: 


Icos(#—@) = Icos (tarctan( E -)) =p-p'. 
P—P 


Since gp'—pq' = 1, we have gp’ = 1(mod p). Therefore, 
Icos a lcos(#—a) = q(p — p’) = —1(mod p). 
From that we have 


Isin(a#—a) = Ising, 
Icos(a—@) = (—Icos a@(modIsin or) |. 


This concludes the proof of Theorem 5.10. 


Remark 5.13. The statements of Theorem 5.7 and Theorem 5.10 were first intro- 
duced in terms of regular continued fractions and so-called zig-zags by P. Popescu- 
Pampu [182]. 


The following statement is an easy corollary of Theorem 5.10. 


Corollary 5.14. For every integer angle a, the following holds: 


m—("%-Q) =a. 


5.5 LLS sequences for adjacent angles 


The formula for the LLS sequences of adjacent angles is a little more complicated 
than the formula for the LLS sequences of transpose angles. In this section we dis- 
cuss it. 
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Let us define a flip on the set of all positive integer sequences of odd length. 


Definition 5.15. Let s be a sequence of an odd number of positive integers. We say 
that st is an adjacency dual sequence to s, where s+ is defined as follows 


¢ Finite sequences: 


1)+ = (1); 
2) = (2); 

ie — (ljk=2,1): 
1,k—2,1)* = (k); 


1,41, 42, ..+5Qm,1)> = (Gm +1, Qm—1,-.+543,€2,a,+1); 
b,a1,Q2,-.-,4n,1)~ = (Qn tl, an_1,---,€2,a1,b—1,1); 
1,41,d2,-..,4n,c)~ = (1,e—1,an,...,a2,a,+1); 

= (1,c—1,ay,...,a2,a1,b—1, 1). 


1,a),€2,...)- =(...,@,a1 +1); 
b,a\,a2,...)- =(...,a2,a1,b—1, 1); 
...,4-2,a4-1,1)- = (a_i+1,a_2,...)5 
..+,4-2,4-1,€)~ = (1,c—1,a_1+1,a_2,...). 


¢ Both sides infinite sequences: 
L_ 
(...,4-2,4-1,40,41,42,...)~ = (...,42,41,40,4-1,4_2,...). 


Here we assume that the indexes m > 2 and n > 1, and the integer elements satisfy 
aj >0; b,c >2;k > 3. 


Remark 5.16. The adjacency duality is a duality, i.e., 


((s)4)t =s. 


Informally speaking the relation between s and s+ is evocative of the relation be- 
tween odd and even regular continued fractions, but applied to both ends of the 
sequence 5. 


Proposition 5.17. The LLS sequences of a rational angle and its adjacent angle are 
adjacency dual to each other. 


Example 5.18. Consider the integer angle 


Acces 225 
= larctan —. 
157 


Then according to Example 5.11 for the adjacent angle we have 
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ma—Qa=7 -—larct BoD si t ae 
=> arctan 157 — arias Eg 


The corresponding LLS sequences for @ and 7 — @ 
(1,2,3,4,5) and (1,4,4,3,3) (=(1,5-1,4,3,2+1)) 


respectively. We have checked Proposition 5.17 for the angle larctan 2S. 
Proof of Proposition 5.17. Consider an integer angle a. 


The cases when 
a<=larctank or QQ &2-—larctank 


for a positive integer k are straightforward and left for the reader to check. 
Now let the LLS sequence for @ be either 
(b,a1,---,4n,C) 
with b > 1 andc > 1, or 
(b,a1, a2, as = 


with b > 1. 

Then the sail of the transposed adjacent angle will start with two vertices (—1,0) 
and (0,1); and the direction of the second vector is (1,b). Therefore, the first two 
elements of (7 — a)’ are 


1é((—1,0), (0,1)) = 
Isin(Z(— POA, De bed 


Further elements of the LLS sequence of (7 — a)! are computed by edge-angle 
duality (see Corollary 3.7 and Theorem 3.8). 


Finally for the case of finite sequences it remains to say that the LLS sequence 
for 2 — @ is reversed to the LLS sequence for (az — a)‘. This concludes the proof for 
the case of @ having the LLS sequence 


(b,a1,...,4n,C) 


with b>1landc>1. 


The proof of the cases starting with an element | is similar, for that reason they 
are omitted here. 


The cases of LLS sequences bounded from the right is reduced to the considered 
cases by transposing all the angles involved. 


Finally the case of doubly infinite sequences is a limiting case of one side infinite 
sequences. Here we consider angles ZA,OB passing through vertices of the sail A; 
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tending to the ray OA. The LLS sequences of ZA;OB and x — ZA;OB asymptotically 
coincide with ZAOB and x — ZAOB respectively, as the ray OA; approaches the ray 
OA. (We return to doubly infinite sequences later, see Proposition 8.6. ) 


Remark 5.19. It is not a surprise that the behavior of the formula for the LLS se- 
quence of adjacent angles has a similar behavior on both its ends. In fact there is 
an informal reason for this symmetry. Indeed let @ = ZAOB and let us denote by 
A’ and B’ the points symmetric to A and B respectively with respect to the origin O. 
Then we have: 


ZAOB = a; 
ZBOA' =1—@; 
ZA'OB! = a. 


So in some sense the angle z — & is surrounded by two copies of a. That provides 
symmetric behavior of the LLS sequences. 


5.6 Right integer angles 


We define right integer angles by analogy with Euclidean angles, using their sym- 
metric properties. 


Definition 5.20. An integer angle is said to be right if it is integer-congruent to both 
its transpose and adjacent angles. 


It turns out that in integer geometry there exist exactly two integer inequivalent 
right integer angles. 


Proposition 5.21. Every integer right angle is integer-congruent to either larctan(1) 
or larctan(2). 


Proof. Let @ be an integer right angle. 
Since (7—0) 4 0 and (17-2) $ 7, we have ltana > 0. 
By the definition of integer right angles and Theorem 5.7, we obtain 


Icos() = Icos(a’) = (Icos @(mod|sina)) 


By the definition of integer right angles and Theorem 5.10, we obtain 


Icos(a@) = Icos(#—a) = (—Icos a@(modIsin ar)". 


Hence, 
(Icos o(mod Isin a) ie (—Icos (mod Isin a) ) ee 


Therefore, lsina@ = 1 or lsina@ = 2. 
The integer angles with lsin @ = | are integer-congruent to larctan(1). The integer 
angles with lsina = 2 are integer-congruent to larctan(2). The proof is complete. 
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5.7 Opposite interior angles 


First we give the definition of parallel lines. Two integer lines are said to be parallel 
if there exists an integer shift of the plane by an integer vector taking the first line to 
the second. 


Definition 5.22. Consider two distinct integer parallel lines AB and CD, where A, B, 
C, and D are integer points. Let the points A and D be in different open half-planes 
with respect to the line BC. Then the integer angle ZABC is called opposite interior 
to the integer angle ZDCB. 


We have the following proposition on opposite interior integer angles. 


Proposition 5.23. Two integer angles opposite interior to each other are integer- 
congruent. 


Proof. Consider two distinct integer parallel lines AB and CD. Let the points A 
and D be in distinct open half-planes with respect to the line BC. Let us prove that 
ZABC = ZDCB. 

Consider the central symmetry S of the two-dimensional plane at the midpoint of 
the segment BC. Let P be an arbitrary integer point. Note that S(P) = C+ PB. Since 
the point C and the vector PB are both integer, S(P) is also integer. Since the central 
symmetry is self-inverse, the inverse map S~! also takes the integer lattice to itself. 

Therefore, the central symmetry S is an integer-affine transformation. Since S 
takes the angle ZABC to the angle ZDCB, we obtain ZABC = ZDCB 


5.8 Exercises 


Exercise 5.1. Find both algebraically and geometrically the integer trigonometric 


functions for the adjacent and transpose angles to the angles i, larctan 3, larctan B. 


Exercise 5.2. Let @ be an arbitrary integer angle. Find expressions in the trigono- 
metric functions for the angles 2 — a’ and (x — a)’. Are these two angles integer 
congruent? 


Exercise 5.3. Can an integer triangle have two integer right angles? What about 
three integer right angles? 


Check for 
updates | 


Chapter 6 
Integer Angles of Integer Triangles 


In this chapter we study integer triangles, based on the results from previous chap- 
ters. We start with the sine formula for integer triangles. Then we introduce integer 
analogues of classical Euclidean criteria for congruence for triangles and present 
several examples. Further, we verify which triples of angles can be taken as angles 
of an integer triangle; this generalizes the Euclidean condition 7+ B + y= for 
the angles of a triangle (this formula will be used in Chapter 17 to study toric sin- 
gularities). Then we exhibit trigonometric relations for angles of integer triangles. 
Finally, we give examples of integer triangles with small area. 


6.1 Integer sine formula 


In this section we give the integer sine formula for angles and edges of integer 
triangles. 


Let A, B,C be three distinct and noncollinear integer points. We denote the integer 
triangle with vertices A, B, and C by AABC. 


Proposition 6.1. (The sine formula for integer triangles.) For any integer triangle 
AABC the following holds: 


1¢(AB) 1é(BC) 1é(CA) 1é(AB) 1é(BC) 1€(CA) 


Isin(ZBCA) — |sin(ZCAB) __|sin(ZABC) 1S(AABC) 


Proof. We have 


1S(AABC) = 1¢(AB) 1¢(AC) lsin(ZCAB) = 1¢(BA) 1€(BC) lsin(ZCBA) 
= 1(CB)1é(CA) lsin(ZACB), 


After inverting the expressions and multiplying by all three integer lengths, we get 
the statement of the proposition. 
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The following is an open problem. 
Problem 1. Find an integer analogue of the cosine formula for triangles. 


Recall that the cosine formula for Euclidean triangle AABC with a = |BC|, b = 
|AC|, c = |AB|, and a = ZBAC is 


a= b* +c* —2becosa. 


It seems that to write an integer cosine formula is a hard task. Currently there are no 
theorems related to addition in integer trigonometry. 


6.2 On integer congruence criteria for triangles 


We start with integer analogues for the three Euclidean critera of triangle congru- 
ence. It turns out that only the first criterion is valid in integer geometry. Further, we 
present an additional criterion of congruence: on three angles and the area. 


Proposition 6.2. (The first criterion of integer triangle integer congruence.) 
Consider integer triangles AABC and AA'B'C'. Suppose that 


AB@A'B', ACZA'C, and ZCAB& ZC'A'B’. 


Then AA'B'C' = AABC. 


Proof. By definition there exists an integer affine transformation taking ZCAB to 
ZC'A'B’. Since the integer lengths of the corresponding segments are the same, the 
transformation takes B and C to B’ and C’. 


It turns out that the literal generalizations of the second and third criteria from 
Euclidean geometry to integer geometry do not hold. The following two examples 
illustrate these phenomena. 


Example 6.3. The second criterion of triangle integer congruence does not hold 
in integer geometry. In Fig. 6.1 we show two integer triangles AABC and AA'B'C’. 
We have 
AB=A'B', ZABC = ZA'B'C' ~ larctan(1), 
and ZCAB ~ ZC'A'B’ = larctan(1). 


The triangle AA’B’C’ is not integer congruent to the triangle AABC, since 


IS(AABC) =4 and 1S(AA‘B‘C’) =8. 


Example 6.4. The third criterion of triangle integer congruence does not hold 
in integer geometry. In Fig. 6.2 we show two integer triangles AABC and AA'B'C’. 
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Fig. 6.2 A counterexample to the third criterion of integer congruence for triangles. 


(GE 


B! A’ 


Fig. 6.3 The additional criterion of integer congruence is not improvable. 


All edges of both triangles are integer congruent (of length one), but the triangles 
are not integer congruent, since IS(AABC) = | and 1S(AA’B’C’) = 3. 


Instead of the second and the third criteria, we have the following additional 
criterion. 


Proposition 6.5. (An additional criterion of integer triangle integer congru- 
ence.) Consider two integer triangles AABC and AA'B'C' of the same integer area. 
Suppose that 


ZABC = ZA'B'C', ZCAB= ZC'A'B’, ZBCA= ZB'C'A'. 
Then AA'B'C' = AABC. 


Proof. All the integer lengths of the corresponding edges are the same by the sine 
formula. Hence the triangles are integer congruent by the first criterion. 


In the following example we show that the additional criterion of integer triangle 
integer congruence is not improvable. 


Example 6.6. In Fig. 6.3 we give an example of two integer inequivalent triangles 
AABC and AA’B'C' of the same integer area 4 and congruent angles ZABC, ZCAB, 
and ZA’B'C’, ZC'A’B’ all integer-equivalent to the angle larctan(1), but AABC # 
AA'B'C'. 


Finally, we give a general formula for integer congruence of two triangles. 
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Proposition 6.7. Consider two integer triangles A,B,C, and AA2B2C)2 with 
BiAj = (Pisdi)s and BC; => (ri, Si) fori= 1,2. 


Then these triangles are vertex-to-vertex integer congruent (i.e., there exists a 
GL(2,Z) transformation sending A, to Az, By to Bz, and C, to C2) if and only if 
the below matrix is in GL(2,Z): 


T= & >) G i, 
q2 $2 qi $1 
Proof. Note that the matrix T is the transition matrix from the basis (B,A;, B,C)) to 


the basis (B2A2,B2C2). Now the statement of the proposition follows directly from 
the definition of the integer congruence for the triangles. 


6.3 On sums of angles in triangles 


In Euclidean geometry a triple of angles is a triple of angles in some triangle if and 
only if their sum equals 7. Let us introduce a generalization of this statement to the 
case of integer geometry. 

First, we reformulate the Euclidean criterion in the form of tan functions. A tri- 
angle with angles a, B, and y exists if and only if 


{ tan(a+B+y) =0, 
tan(a+B) ¢ [0;tan a] 


(without loss of generality, we suppose that & is acute). The next theorem is a trans- 
lation of this condition into the integer case. 

Second we give several preliminary definitions. 

Let n be an arbitrary positive integer, and let A = (x,y) be an arbitrary integer 
point. Denote by 7A the point (nx,ny). 


Definition 6.8. Consider an integer polygon or broken line with vertices Ao,...,Ax. 
The polygon or broken line nAop...nAx is called n-multiple (or multiple) to Ao,...,Ax. 


Let p; (for i= 1,...,k) be rational numbers and let [aj ;;a2; : --- : dy,] be their 
odd continued fractions. Define 


[Pay ++ Pe[= [a1,15@2,15° + Qn, 1281,2:02,22° Ang 220 10 gD A “dn, kl 
For instance, 
31 


Jpeg [= lstrtst:2:3.= 5. 
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Now we are ready to formulate the generalization of the Euclidean theorem on 
sum of angles in triangles. 


Theorem 6.9. On sums of integer tangents of angles in integer triangles. 
(i) Let (01,Q,03) be an ordered triple of angles. There exists a triangle with 
consecutive angles integer congruent to Q), O2, and O3 if and only if there exists 
i€ {1,2,3} such that the angles & = Oj, B = O;+1(mod3)» Y = %i+2(mod3) Satisfy the 
following conditions: 
(a) for & =] |tan a, —1,Itan B|, the following holds: € <0or & > Itana or € =~; 
(b) ]ltana, —1,ltanB, —1,ltan y[= 0. 
(ii) Let a, B, and y be the consecutive angles of some integer triangle. Then this tri- 
angle is multiple to the triangle with vertices Ag = (0,0), Bo = (Aglcos @, Az |sin @), 
and Co = (41,0), where 


_ Isin B 
~ gcd(Isin a, sin B , lsin 7) 


Isiny 


A 7 gcd(Isin a, lsin B,Isin y) ° 


and Ag 


We are not yet ready to prove the first statement of this theorem; we shall do 

it later, in Section 16. We prove the second statement of the theorem after a small 
remark. 
Remark 6.10. The statement of Theorem 6.9(i) does not necessarily hold for even 
continued fractions of the tangents. For instance, consider an integer triangle with 
integer area equaling 7 and all angles integer congruent to larctan7/3. For the odd 
continued fractions 7/3 = [2;2: 1] of all angles we have 


[2;2:1:—-1:2:2:1:-1:2:2:1])=0. 


If instead we take the even continued fractions 7/3 = [2;3], then we have 
35 
233: —1:2:3:-1:2:3)= 7, £0. 


Remark 6.11. In Chapter 17 we discuss the relation on integer angles of integer 
polygons similar to the case of triangles. 


Example 6.12. Let us consider the triangle with vertices A = (0,0), B = (1,3), and 
C = (3,0). Set @ = ZCAB, B = ZABC, and y = ZBCA: 


Then we have: 
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Itana = 3 = [3]; 
ItanB = 9/7 = [1;3 : 2]; 
Itany = 3/2 = [131: 1]. 


Let us check the expressions for both conditions of Theorem 6.9 for the triple 
(a, B, 7). We have 


(a)|ltan a, —1,ltan B[= [3;-1: 1:3: 2] =—3/2 ¢ [0,3] = (0, ltan a]; 
(b)| Itana@, —1,ltanB,—1,ltan y[= [3;-1:1:3:2:—-1:1:1:1}=0. 


In our case both conditions are fulfilled. 


Proof of the second statement of Theorem 6.9. Consider a triangle AABC with ra- 
tional angles a, B, and y (at vertices at A, B, and C respectively). 


Suppose that S is the integer area of AABC. Then by the definition of integer 
sine, the following holds: 


10(AB) 1€(AC) = S/Isina, 
10(BC) 1€(BA) = S/Isin B, 
1€(CA)1¢(CB) = S/Isiny. 


Let 1¢(AC) = alsin B, then from the above system of equations we have 


\si 
: jee -alsinB = alsin y. 


as B-10(BC)  IsinB 


Similarly, 
1€(BC) = alsina. 


By the first criterion of integer congruence, the triangle AABC is integer congruent 
to AA,B,C), where 


A, =(0,0) B, = (alsinylcosa,alsinylsina), C, = (alsinB,0). 
It is also clear that the condition for the triangle AA;B,C; to be integer is as follows: 


k 
= gcd(lsin a, lsin B, lsiny) 


where k should be a positive integer. Therefore, the triangle AA; B,C, is multiple to 
the triangle AAgBoCo of the theorem. 


6.4 Angles and segments of integer triangles 


Let us find the integer tangents of all angles and the integer lengths of all edges 
of any integer triangle, knowing the integer lengths of two edges and the integer 
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tangent of the angle between them. Suppose that we know the integer lengths of the 
edges AB, AC and the integer tangent of the angle 7BAC in the triangle AABC. Let 
us show how to restore the integer length and the integer tangents for the remaining 
edge and the rational angles of the triangle. 

For simplicity we fix some integer basis and use the system of coordinates OXY 
corresponding to this basis (denoted by (x, *)). 


Theorem 6.13. Consider some triangle AABC. Let 
1¢(AB) =c, 1€(AC)=b, and ZCAB=a. 
Then the angles ZBCA and ZABC are defined in the following way: 


a — larctan (=o) if cleosa > b, 


ZBCA = ¢ larctan(1) if cleosa = b, 
larctan’ Canc if cleosa < b, 
7m —larctan’ ae | if blcos(a’) > c, 
ZABC = ¢ larctan(1) if blcos(a’) =c, 


larctan (2S) if blcos(a@’) <c. 


Proof. We start with proving the formula for the angle 7BCA. Let a = larctan(p/q), 
where gcd(p,q) = 1. Then ACAB ~ ADOE, where D = (b,0), O = (0,0), and 
E = (qc,pc). Let us express the angle ZDEO. Denote by Q the point (qc,0). If 
qc—b = 0, then ZBCA = ZDEO = larctan 1. If gc—b # 0, then we have 


Isi 
ZQDE = larctan (4) = larctan (<=) : 


cq clecosa—b 


The expression for the angle 7BCA follows directly from the above expression for 
ZQDE, since ZBCA =~ ZQDE. (See Fig. 6.4; here 1¢(OD) = b, 1€(OQ) = clcos a, 
and therefore 1¢(DQ) = |clcos a — b].) 

To obtain an expression for the angle ZABC, we consider the triangle ABAC. 
Calculate the angle ZCBA and then transpose all angles in the expression. Finally, 
the integer length of CB is defined from the integer sine formula. 


6.5 Examples of integer triangles 


Let us define certain types of triangles occurring in integer geometry. Since dual 
angles are not necessarily congruent, we have more different types than in the Eu- 
clidean case. 


Definition 6.14. An integer triangle AACB is said to be dual to the triangle AABC. 
An integer triangle is said to be self-dual if it is integer congruent to the dual trian- 
gle. 
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c:lcosa > b c:lcosa = b c:-leosa < b 


Fig. 6.4 Three possible configuration of points O, D, and Q. 


An integer triangle is said to be pseudo-isosceles if it has at least two integer con- 
gruent angles. 

An integer triangle is said to be integer isosceles if it is pseudo-isosceles and self- 
dual. 

An integer triangle is said to be pseudo-regular if all its angles and all its edges are 
integer congruent. 

An integer triangle is said to be integer regular if it is pseudo-regular and self-dual. 


By the first criterion of integer congruence for integer triangles, the number of 
integer congruence classes for integer triangles with bounded integer area is always 
finite. In Fig. 6.5 we show the complete list of 33 triangles representing all integer 
congruence classes of integer triangles with integer areas not greater than 10. We 
enumerate the vertices of the triangle clockwise. Near each vertex of a triangle we 
write the tangent of the corresponding rational angle. Inside any triangle we write its 
area. We draw dual triangles on the same light gray region (if they are not self-dual). 
Integer regular triangles are colored in dark gray, integer isosceles but not integer 
regular triangles are white, and the others are light gray. 


Integer triangles of small area. The above criteria allow us to enumerate all 
integer triangles of small integer area up to integer equivalence. In the following 
table we write down the numbers N(d) of noncongruent integer triangles of integer 
area d for d < 20 (here the dual noncongruent triangles are counted as two). 


d_ \|1/2/3)4|5)6)7|8|9| 10) 11) 12) 13) 14}15}16/17}18}19}20 
N(d)}}1|1]2}3|2)4]4|5)/5] 6 | 4 |10) 6 | 8 | 8 ]11] 6 |13] 8 14 


Let us prove an easy statement on the statistics of N(d). 


Proposition 6.15. We have 
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9/2 


1 3 3/2 3 
9/5 e e e 978 e e e e e 10, e e e * 9 CJ e e °108/9 
; . e e e ! ! e e e e e 
5 
5/2 


5/3 


3 5/ 


Fig. 6.5 List of integer triangles of integer area less than or equal to 10. 
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Proof. First, let us show that N(d) < d(d+1)/2. From Theorem 6.9(ii) we know 
that every integer triangle is equivalent to some AAgBoCp where 


Ao = (0,0), Bo =(Azg,A2p), Co=(A1,0), 


where 0 < q < p. The area of the triangle is exactly AjA,p. Hence Azp < d. There 
are exactly d(d +1) /2 integer points satisfying all listed conditions. Each such point 
can be chosen to construct Bo. 

If d is divisible by Azq, we have an integer triangle with Co constructed uniquely 


by setting 
Co = ( ’ 0) 
0 Aoq’ : 
Hence there are at most d(d + 1)/2 triangles of area d. 


Let us show that N(d) > d/3. There are exactly d noncongruent angles that ap- 
pear in triangles of area d: 


laretan ©, k=1,2,...,d. 


Each triangle contains at most three noncongruent angles. Hence N(d) > d/3. 


It appears that the growth rate is linear. For instance, for a prime number d, we 
always have N(d) < d. Still, in some exceptional cases, when d has many divisors, 
it can happen that N(d) > d. For instance, N(240) = 248. 


6.6 Exercises 


Exercise 6.1. Find all triangles of area 11. 


Exercise 6.2. Let ltan(ABC) = 7/5; 1¢(AB) = 3; 1¢(BC) = 5. Find the remaining 
angles and the remaining edge. 


Exercise 6.3. Prove that in Euclidean geometry there exists a triangle with pre- 
scribed angles a, B, and y if and only if 


{ tan(a+B+y) =0, 
tan(a+B) ¢ [0;tan @] 


(here without loss of generality we suppose that a is acute). 


Exercise 6.4. Is there a triangle with angles congruent to 
larctan(24/7), larctan(24/13), and larctan4? 


Exercise 6.5. Suppose g has n divisors. Find an upper estimate on N(q) linear in d 
and n. 


| ® 
Check for 
updates 


Chapter 7 


Minima of Quadratic Forms, the Markov 
Spectrum and the Markov-Davenport 
Characteristics 


The theory of Markov numbers studies minima of the absolute values of the 
quadratic decomposable forms in the plane over the lattice of integer points (ex- 
cluding the origin). In this chapter we briefly discuss this classical subject, focusing 
on the discrete Markov spectrum that has the most relevant connection to geome- 
try of continued fractions. We conclude this chapter with the notion of Markov— 
Davenport characteristic that we use later in the study of the conjugacy classes for 
matrices. 


7.1 Calculation of minima of quadratic forms 


Consider a quadratic form 
f(x,y) = ax? +bxy+cy" 
with real coefficients and positive discriminant A(f) = b* — 4ac. Set 


m(f)= inf F(x, y)|. 
) exspettrcoay M 


The set of all possible values of ,\/A(f)/m(f) is called the Markov spectrum. For a 
general reference on the subject we recommend the book of T.W. Cusick [47]. 


Proposition 7.1. (i) Let f be a quadratic form with positive discriminant. Consider 
the geometric continued fraction defined by two lines, all of whose points satisfy the 
equation 


f(x,y) = 0. 


Let S be the set of all vertices in all four sails of this continued fraction. Then we 
have 


m(f)= inf |f(,y)I- 


(xy)ES 
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(ii) Let the LLS sequence of the continued fraction be |...,a—1,a0,4,,...]. Suppose 
that v; = (xi,yi) is a vertex of the sail corresponding to the element a; of the sail 
(i.e., the integer sine at vertex v; in the sail equals a;). Then 


VA(f)/m(f) = sup (ai + [0sa41 Pago tie | + Oat aaa: ---]). 


The first statement of Proposition 7.1 is a straightforward corollary of the con- 
vexity of sails. For further details regarding the second statement of Proposition 7.1 
we refer to the original papers of A. Markov [146] and [147] (see also [47]). 


Remark 7.2. If the LLS sequence is periodic, the expression of Proposition 7.1 (ii) 


VAD Im(A) = (aot eet) ae 


ltana Itana‘ Isin? o 
where the angle @ is integer congruent to larctan({a1;--- : d2n—1]), and where 
(a0,41,---,@2n—1) is some minimal even period of the LLS sequence. 


If f has integer coefficients, the corresponding continued fraction is either finite 
or periodic. The minimum is then attained at some vertex of the sail of the geometric 
continued fraction. So the calculation of this minimum is similar to the calculation 
of the complexity of minimal periods studied above in this chapter. 


Remark 7.3. Instead of a quadratic form in two variables one can take a form of 
degree n in n variables corresponding to the product of n linear real forms. Any 
such form defines a point in the multidimensional Markov spectrum. Namely, for 
a form f one takes the infimum of the set of absolute values of f for all nonzero 
integer points divided by the n-th root of the discriminant of f. There is not much 
known about the multidimensional Markov spectrum. We refer the interested reader 
to [55], [56], [57], and [209] (see also in [79]). 


7.2 Some properties of Markov spectrum 


The Markov spectrum is a closed set that has a complicated structure. It does not 
contain points less than \/5. On the segment [\/5,3] the spectrum has a unique limit 
point, which is 3. It was described by A. Markov (see in [146] and [147]). The first 
elements in the spectrum in the increasing order are as follows: 
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V5=1+4+(0: (1)] 
8 0: 


V8 =1+(0: (2)]+(0: 2)) 
ve = 24 0;(2:1:1:2))+[0;(1:1:2:2)], 
1517 


=2+4+(0;(2:1:1:1:1:2))+[0;(1:1:1:1:2:2)], 


=24(0;(2:2:2:1:1:2))+[0;(1:1:2:2:2:2)], 


=24+(0;(2:1:1:1:1:1:1:2))+[0;(:1:1:1:1:1:2:2)). 


All these points are enumerated in Theorem 7.4. All real numbers greater than the 
so-called Freiman’s constant 


__ 221564096 + 283748462 


= 4.52782 Lis 
491993569 eel oa ee 


are in the Markov spectrum. The segment [3,F'] has not been completely studied. 
It has many gaps, i.e., open segments that belong to the complement to the Markov 
spectrum. The largest gap in the Markov spectrum below 3 is the open segment 
(12, 13). The first gaps, including the segment (12, 13), were calculated by 
O. Perron in [179] and [180]. Nowadays many other gaps are known, but a complete 
description of the gaps is unknown. For further information we refer the interested 
reader to [44] and [47] (see also [91]). 


It is also known that in some neighborhood of 3 the Markov spectrum has 
Lebesgue measure zero (for instance, in the paper [35] by R.T. Bumby, it is shown 
that the spectrum is of measure zero for the the ray x < 3.33440). 


The value 3 of the spectrum is interesting on its own. For example, as stated 
in [47] for any real number x and positive integer n we have that 


2+ (0; (1: 1)" :%]-+(0;2: (1:1)?! : 2] =3. 


Hence, 3 is in fact represented by infinitely many forms. 


Finally for the discrete Markov Spectrum we have the following remarkable the- 
orem. 


Theorem 7.4. (A. Markov [147]) (i) The Markov spectrum below 3 consists of the 
numbers \/9m? — 4/m, where m is a positive integer such that 


me +m +m, = 3mmm, mz <m, <m, 


for some positive integers m, and m2. 
(ii) Let the triple (m,m,,mz) fulfill the conditions of item (i). Suppose that u is the 
least positive residue satisfying 
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mau = iM, modm 


and v is defined from 


Then the form 


fal.) = mx? + (3m—2u)xy + (v—3u)y? 


represents the value \/9m2 —4/m in the Markov spectrum. The form representing 
this value is unique up to the GL(2,Z) group action and the multiplication by a 
non-zero constant. 


Remark 7.5. It is interesting to observe that the theory of the Markov spectrum is 
based on the study of continued fractions with elements 1, 2, and 3. This is due to 
the fact that LLS sequences containing greater elements correspond to the quadratic 
forms that contribute to the Markov spectrum above Freiman’s constant. 


7.3 Markov numbers 


Finally, let us say a few words about the integer solutions of the equation mentioned 
in Theorem 7.4. The equation 


P+ te = 3xyz 


is called the Markov Diophantine equation. A Markov number is a positive integer x 
for which there exist positive integers y and z such that the triple (x,y,z) is a solution 
of the Markov Diophantine equation. The first few Markov numbers are 


1,2,5, 13,29, 34,89, 169, 194,233,433, 610,985, 1325. 
The corresponding solutions are Markov triples 
(1,1,1),(1,1,2), (1,2,5), (1,5, 13), (2,5,29), (1, 13, 34), (1,34, 89), (2,29, 169),... 


It turns out that Markov triples possess the following regularity. 


Proposition 7.6. Every Markov triple is obtained from (1,1,1) by applying a se- 
quence operations of the following two types: 

— permute the numbers x, y, and z in the triple (x,y,z); 

— if (x,y,z) is a Markov triple, then (x,y,3xy — z) is also a Markov triple. 


Without loss of generality we consider only solutions with x < y < z. The struc- 
ture of such solutions forms a tree, whose vertices are almost all of valence 3 (except 
for two vertices). An edge between two of vertices corresponds to the operation de- 
scribed in Proposition 7.6. We show several of the first vertices of the tree in Fig. 7.1. 
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(11) 


aes 


(ly 2,5) 


(1,5, 13) 


(2,5, 29) 


(1, 18,34) << 
(5, 18,194) 
(5, 29, 433) << 
(2, 29, 169) oF 


(1, 
( 
( 
( 
( 
( 
( 
(2, 


1, 34, 89) 
13, 34, 1325) 
13, 194, 7561) 
5, 194, 2897) 
5, 433, 6466) 
29, 433, 37666) 
29, 169, 14701) 
169, 985) 


Fig. 7.1 The Markov number tree. 
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Finally we would like to mention one very interesting old conjecture related to 
number theoretical properties of triples. 


Conjecture 2. Any Markov number occurs only once as the greatest element of a 
Markov triple. 


This conjecture was introduced by G. Frobenius in 1913 (see [65], pp. 458-487) 
and studied very intensively since that due to its relations to modular groups and 
other questions of modern number theory. We refer to the book [3] by M. Aigner for 
a comprehensive description of history and development of the conjecture. 


7.4 Markov—Davenport characteristic 


In this section we relate matrices with integer coefficients to quadratic forms with 
integer coefficients. 


Definition 7.7. The Markov—Davenport characteristic (or the MD-characteristic, 
for short) of any 2 x 2 matrix A is the functional 


A,:R?=R defined by Aa(v) = 2S(0,v,A(v)), 


where S(O,v,A(v)) is the nonoriented area of the triangle with vertices v, A(v), and 
the origin O. 


Remark 7.8. In the case when A has two real distinct eigenvalues, A, is propor- 
tional to the absolute value of the product of two linear forms annulating differ- 
ent eigenlines of A (see Proposition 7.12 below). Such forms were considered by 
A. Markov in relation to Markov minima, (see Remark 7.14). Similar forms in the 
three-dimensional case were considered by H. Davenport, see, e.g., [55, 56, 57]. 
Later in Subsection 25.4.1 we return to the study of the MD-characteristic in the 
multidimensional settings. 


Example 7.9. Let us consider an example of matrix 
13 
A= € : , 


A(v) = (x+ 3y,2x+4y). 


Consider v = (x,y), then 


Then 
Aa(x,y) = 


Xx x+3y = 2 2 
det (3) |= 2 + 3xy — 3y’|. 


As we see the MD-characteristic is a quadratic function on the plane. In general 
we have the following formula. 
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Aas 


Aa(x,y) = |bx? + (d —a)xy—cy”|. 


Proposition 7.10. Let 


Then 


Proof. We have 


Aa(x,y) = 


X ax cy oe 5} _ a ass 
det 6 a) = |bx“ + (d—a)xy—cy’|. 


One of the important properties of the MD-characteristic for a GL(2,Z) matrix 
A is its invariance under the action of A. Namely, we have the following statement. 


Proposition 7.11. We have 
Aa(v) = Aa(A(v)). 


Proof. The triangle with vertices O, v,A(v) is integer congruent to the triangle with 
vertices O = A(O),A(v),A?(v) =A(A(v)). Therefore, 


Aa(v) = Ag(A(v)). 


The invariance of the MD-characteristic has the following geometric explanation 
in terms of eigenlines. 


Proposition 7.12. Assume that A has two distinct eigenvalues. Let l, and ly be two 
nonzero linear forms annulating eigenspaces of these eigenvectors (they might be 
complex here). Then there exists a non-zero constant & such that 


Ag = |(1) -b2)|. 


Proof. The statement is straightforward in the coordinates of the eigenbasis. 


Remark 7.13. In particular from Proposition 7.12 one may conclude the following. 
If a matrix A has two distinct real eigenvalues then the level sets of A, are hyperbolae 
whose asymptotes are eigenspaces. If a matrix A has two distinct complex (non-real) 
eigenvalues then the level sets of A, are ellipses. (See Fig. 7.2.) 


Remark 7.14. (Markov minima for matrices.) Consider an integer matrix with dis- 
tinct real irrational eigenvalues. Definition 7.7 suggests that the elements of Markov 
spectrum can be associated to matrices via minima of the absolute values of their 
MD-characteristic on the integer lattice except the origin. Note that such minima 
are attained at vertices of one of the sails of four angles centered at the origin and 
with edges on inct eigenlines of the corresponding matrix. (For further information, 
see [110, 111].) 
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Fig. 7.2 Level sets of A, in elliptic (Left) and hyperbolic (Right) cases. 


7.5 Exercises 


Exercise 7.1. Find Markov minima of the following forms 
(a)x*+xy—y?;  (b) 2x*—4xy+y?;  (c) 2x2 —7y”. 


Exercise 7.2. Find matrices whose MD-characteristics represent the first five ele- 
ments in the Markov spectrum. 


Check for 
updates | 


Chapter 8 
Geometric Continued Fractions 


In this chapter we set a more general definition of geometric continued fractions, 
which is related to the arrangements of pairs of distinct lines passing through the ori- 
gin (see section 8.1 for basic definitions). Further in Subsection 8.2 we relate these 
continued fractions to GL(2,R) matrices. In Section 8.3 we show that all four LLS 
sequences in the geometric continued fraction are the same for the case of irrational 
slopes of the lines. This property implies that the LLS sequence is well-defined with 
hyperbolic matrices (see in Section 8.4). We discuss periodic properties of algebraic 
sails related to GL(2,Z) operators in Section 8.5. This chapter is concluded in Sec- 
tion 8.6 with the a small discussion on how to compute LLS cycles of GL(2,Z) 
matrices. 


8.1 Definition of a geometric continued fraction 


In 1895, F. Klein introduced a generalization of the geometric interpretation of reg- 
ular continued fractions to the multidimensional case (see his original papers [119] 
and [120]). We will discuss the general multidimensional case in the second part 
of the book. Now we give a definition for the one-dimensional Klein geometric 
continued fraction. It is slightly different from the definition of regular continued 
fractions. 


Definition 8.1. Consider a pair of distinct lines /; and /2 passing through an integer 
point O. The lines /; and /y divide the plane into four cones. A geometric continued 
fraction is the union of the sails of these angles corresponding to these cones. 


Notice that the LLS sequences of the angles between the lines are lattice invari- 
ants of a geometric continued fraction. Since the LLS sequence of an angle is its 
integer complete invariant (see Theorem 4.11), we have the following corollary. 


Corollary 8.2. The LLS sequence is a complete invariant of lattice angles under 
the group of lattice affine transformations Aff(2,Z). 
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8.2 Geometric continued fractions of hyperbolic GL(2, R) 
matrices 


Recall the following general definition. 


Definition 8.3. A GL(2,R) matrix is called hyperbolic/parabolic/elliptic if its 
eigenvalues are distinct and real/coincide/distinct and complex conjugate. 


Let us associate continued fractions to hyperbolic matrices. 


Definition 8.4. Consider a hyperbolic matrix A in GL(2,R) with two distinct eigen- 
lines. The geometric continued fraction defined by these two straight lines is said to 
be associated to A. 


In Fig. 8.1 we show the geometric continued fraction for the matrix 


7 18 
9:13) ° 
Integer lengths of edges are denoted by black digits, and integer sines, by white. The 


LLS sequences of all four sails are equivalent to or inverse to 


Cu Oda), 


Fig. 8.1 Geometric continued fraction of the matrix @ a) ‘ 


8.3 Duality of sails 


As we have already seen in Fig. 8.1, the continued fractions of adjacent angles are 
equivalent up to a reversal of order. This happens not by chance, such situation is 
general. Before showing this, we discuss the notion of sail duality. 
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Definition 8.5. Two sails are dual with respect to each other if the LLS sequence 
of one sail coincides (up to an index shift if the LLS sequences are infinite on both 
sides) with the inverted LLS sequence of the second sail such that the subsequences 
of lengths and angles of the first sail coincide respectively with the subsequences of 
angles and sails of the second one. 


Proposition 8.6. Let A be a hyperbolic matrix with no integer eigenvectors. Then 
the sails of the opposite octants are congruent. The sails of the adjacent octants are 
dual. 


Proof. The sails of opposite octants are congruent, since they are taken one to an- 
other by the symmetry about the origin. 


Let us prove the duality. Let one of the sails for the geometric continued fraction 
be a broken line (A;). Without loss of generality we may fix coordinates such that 
Ao = (1,0) and A; = (1,ao). Set Bo = (0,1). Notice that Bo is on the dual sail 
(since the sail is defined by the lines y = ax and y = Bx, where ag < @ < aj+1 and 
—1< B <(0in the chosen coordinates). Denote the remaining points of the dual sail 
by Bj, starting from Bo. 

Let (a;) and (b;) be the LLS sequences for the sails (A;) and (B;). In Section 3 we 
have already shown the edge—angle duality in the orthant of points with positive 
coordinates. In other words, we know that a; = b_;_; fori > 1. 

Let B’ be the point symmetric to B; with respect to the origin (i € Z). Recall that 


B_; =(a),a9a; +1) and Az = (aja2+1,apa1a2 +a9 +42). 


Consider the linear transformation taking A2 to (1,0) and A, to (1,a2), namely 


aga; +1 —a\ 
aoaja2 +ay + a2 —ajan—1)° 
This transformation takes the point B’_, to the point (0,1). Now we have edge— 


angle duality of A2A;Ao... and B’!_,Bp... . Therefore, a; = b_j-; for i < 1. Hence 
the LLS sequences are inverse to each other. 


Example 8.7. Let us consider the example of the matrix 


718 
(; 4 
mentioned before (See Fig. 8.1). As we can see the LLS sequences for all four sails 
coincide and are periodic with period (1,1,3,2). Note the following small difference 
between the sails and their dual sails: The sail containing the point (1,0) and its 
opposite sail have integer sines with period (1,3) and integer lengths with period 


(1,2). However, two dual sails both have integer sines with period (1,2) and integer 
lengths with period (1,3). 
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8.4 LLS sequences for hyperbolic matrices 


Let us define LLS sequences for hyperbolic matrices. 


Definition 8.8. The LLS sequence of a matrix A in GL(2,R) is the LLS sequence 
for any of its sails up to the choice of a direction of a sequence and zero element. 


In the next proposition we list some basic properties of LLS sequences for hy- 
perbolic matrices. 


Proposition 8.9. (i) For any sequence of integers infinite in two sides there exists 
a hyperbolic matrix whose LLS sequence coincides with the given sequence. 

(ii) Let hyperbolic matrices A and B have the same LLS sequence. Then there 
exists a matrix C commuting with A and integer conjugate to B. 


8.5 Algebraic sails and their LLS cycles 


8.5.1 Algebraic sails 


Consider now the case of hyperbolic matrices in GL(2, Z). It turns out that all such 
matrices have characteristic polynomials irreducible over the rational numbers. The 
sails of such matrices are called algebraic. 


Proposition 8.10. Every algebraic sail has a periodic LLS sequence. 


Remark 8.11. Later, in Corollary 11.10, we show that a sail with periodic LLS 
sequence is algebraic. 


Proof. Let A be hyperbolic matrix in SL(2,Z). The matrix A preserves its invariant 
lines and the lattice Z?. Hence it acts on each of the sails by shifting the vertices of 
the corresponding broken line along the broken line. Therefore, the LLS sequences 
of algebraic sails are periodic. 


In Fig. 8.1, we show the sails of a hyperbolic algebraic matrix with the period of 
the LLS sequence equal to (2,1, 1,3). 


8.5.2 LLS periods and LLS cycles of GL(2,Z) matrices 


We start with the hyperbolic matrices with positive eigenvalues (here we require that 
these eigenvalues are irrational). Such matrices act on each of their sails as a shift, 
so we can give the following definition. 
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Definition 8.12. Let M be an SL(2,Z) matrix with positive irrational eigenvalues. 
Then M acts on the sail for M as a shift. Hence M also defines a shift of the LLS 
sequence. The factor of the LLS sequence with respect to this shift is called the LLS 
cycle of M. 

Any sequence of the corresponding LLS cycle (i.e., we mark a starting element of 
the LLS-cycle) is called an LLS period of M. 


So the LLS cycle is a cyclically ordered sequence of an even number of integer 
elements. Notice that (1,2,1,2) and (2,1,2,1) are distinct LLS periods represent- 
ing the same LLS cycle. Notice further that the LLS periods (1,2,1,2) and (1,2) 
represent distinct LLS sycles. 


If one or both of the eigenvalues of M are negative, then we still can define the 
LLS cycles and LLS periods using the fact that the eigenvalues of M? are positive. 


Definition 8.13. Let M be a hyperbolic GL(2,Z) matrix. Then M* has the LLS 
cycle, which is a twice repeated cyclic sequence. We say that this cyclic sequence 
(once repeated) is the LLS cycle of M. 

Any (non-cyclic) sequence of the corresponding LLS cycle is called an LLS period 
of M. 


Remark 8.14. Note that inverse matrices to each other have reversed periods. It is 
interesting to note that for palindromic sequences the matrices are conjugate to their 
inverses. 


For matrices with positive eigenvalues we have the following proposition. 


Proposition 8.15. Two hyperbolic SL(2,Z) matrices M, and Mp with positive irra- 
tional eigenvalues are integer conjugate if and only if their LLS cycles coincide. 


Proof. Let M; and Mp) be integer conjugate. Then they have integer congruent ge- 
ometric continued fractions, and they define the same shift of these continued frac- 
tions. Therefore the LLS cycles of M; and Mp) coincide. 

Suppose now that the LLS cycles of the matrices M; and M2 coincide. This means 
that the corresponding continued fractions have the same LLS sequences, and there- 
fore they are integer congruent. Therefore, there exists a matrix M4 integer conjugate 
to M2 whose geometric continued fraction coincides with the geometric continued 
fraction of M;. Suppose v is a vertex of the geometric continued fraction of the 
operator M,. Then 


Mi(v) =M3(v) and M,(Mj(v)) = M3(Mj(v)), 


since the matrices M; and M4 define the same shift of the geometric continued frac- 
tion. Since the vectors v and M;(v) form a basis of R*, we have 


M,=M3. 


Therefore, M, is integer conjugate to Mo. 


In the case of arbitrary GL(2,Z) matrices we have a weaker statement. 
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Corollary 8.16. Consider two hyperbolic SL(2,Z) matrices M, and Mp with irra- 
tional eigenvalues. Then the LLS cycles M, and M2 coincide if and only if M, is 
integer conjugate either to Mz or to —M). 


Remark 8.17. This statement is rather surprising as the matrix equation 
X°=M 


has four GL(2,R) solutions in the case where the eigenvalues of M are positive and 
distinct. 


Proof. It is clear that if M, is integer conjugate either to M2 or to —M> then LLS 
cycles coincide. 


Now let LLS cycles coincide. Then M is integer conjugate to M3. Let us consider 
two cases. 

In the case where LLS cycles of M, and M2 have even number of elements, then 
M does not swap lengths and sines in the LLS sequence, hence it maps the cones 
either to themselves or to the opposite ones. In both cases we have detM, -detM2 = 
1, so the eigenvalues are either simultaneously positive or negative for both M, and 
Mp). Therefore, Mj is integer conjugate either to Mz or to —Mp. 

In the case where LLS cycles of M, and M> have an odd number of elements, 
then both M, and M) swap lengths and sines in the LLS sequence, hence it maps the 
cones to the adjacent ones. In both cases we have detM, = detM = —1, so one of 
the eigenvalues is positive and one is negative for both M; and M2. Therefore, M, is 
integer conjugate either to M2 or to —M). 

In both cases M is integer conjugate either to M2 or to —M). This concludes the 
proof. 


We conclude this subsection with a statement on the form of LLS sequences for 
a given matrix. 
Proposition 8.18. Let M be a GL(2,Z) hyperbolic matrix with irrational eigenval- 
ues and let 

(a1,---,4n) 
be its LLS cycle. Let also m be the minimal lengths of the periods of the LLS se- 
quence for M. Then the list of all LLS periods for M consists of the following m 
sequences 
(Qk415---;Akin) fork=1,...,m 


(here the indices are taken modulo n). 


Proof. Operator M? acts on all sails as shifts by n vertices. Hence we get all of the 
periods are obtained by applying M to vertices of both sails. 


Remark 8.19. Note that all periods starting at elements of the LLS sequence with 
odd indexing correspond to the same sail. All periods starting at elements with even 
indexing correspond to the sails of the adjacent angles. 
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8.6 Computing LLS cycles of GL(2,Z) matrices 


In this section we address a question of computing LLS sequences for GL(2,Z) 
matrices with irrational real eigenvalues (such eigenvalues will be automatically 
distinct). 


8.6.1 Differences of sequences 


Let us start with the following supplementary definition. 


Definition 8.20. Let m > n be non-negative integers. Consider two number se- 
quences 
Sa = (a1,---,4m) and Sp = (b1,...,Dn). 


We say that there exists a difference of the sequences S, and Sy, if there exists k < 
m-+ 1 satisfying the following conditions 

(a) bj =a; for 1 <i<k; 

(b) either k =m-+1 or by 4 ag; 

(C) beri = k+it+m—n forO Sicn—k. 
In this case we write 


Sa — Sp = (Gk, 441, +++, @k+n—m—1)- 
Example 8.21. We start with the following example: 
(1,2,3,4,5,6,7,8) — (1,2,3,6,7,8) = (4,5). 
Example 8.22. The expression 
(1,2,3,4,5,6,7,8) — (1,4,8) 


does not exist. 


8.6.2 LLS cycles for SL(2,Z) matrices with positive eigenvalues 


In this subsection we show how to compute LLS cycles for SL(2,Z) matrices with 
positive irrational eigenvalues (see also in [112]). The computation is based on the 
following statement. 


Proposition 8.23. Let an SL(2,Z) matrix M have distinct irrational positive eigen- 
values. Let also P be any non-zero integer point. Then there exists a difference 


LLS(ZPO(M3(P))) —LLS(ZPO(M?(P))), 
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and the resulting sequence is an LLS period of M. 


Remark 8.24. It can happen that the obtained period of the LLS sequence are not of 
the minimal length. 


Example 8.25. Consider an SL(2,Z) matrix 
7 —37 
oe (2h 2) ; 


Note that this matrix has two eigenvalues 30 + 899 that are both positive irrational 
numbers. Hence we can apply Proposition 8.23. 
Let us take P = (1,1). Then we have 


M?(P) = (—1801,2579) and M3(P) = (—108030, 154697). 


Further we compute LLS sequences of the corresponding angles (e.g., using one of 
the algorithms of Section 4.5): 


LLS(ZPO(M?(P))) = (1,1,2,3,5,1,2,3,4); 
LLS(ZPO(M3(P))) = (1,1,2,3,5, 1,2,3,5,1,2,3,4). 


Hence 
LLS(ZPO(M?(P))) —LLS(ZPO(M?(P))) = (5, 1,2,3). 


Therefore, the LLS cycle of M is (5,1,2,3). 


Proof of Proposition 8.23. The complement to the union of eigenlines consists of 
four cones. Consider one of these cones containing the point P (denote it by C). 
Then the convex angle ZPoOP, is the fundamental domain of C up to the action of 
the group of (integer) powers of M. Hence there is at least one vertex of the sail for 
C in the interior of ZP9OP;. Denote one of the sail vertices inside ZP9OP,; by v. We 
immediately get that the angle ZPy)OP; contains vertices 


vo=v, vi=M(v), and v2 = M?(v). 


Thus by convexity reasons, the sail for the angle ZPoOP3 contains the part of the 
sail of C between the vertices vg and v2. 
Namely, the sail of ZPoOP3 consists of the following four parts: 


e $1: a part of the sail contained in PyOvo; 
e S$»: a part of the sail contained in vgOv}; 
e $3: a part of the sail contained in vj Ov2; 
e $4: a part of the sail contained in v2OP3. 


Since vj} = M(v) and v7 = M?(v) the parts Sz and $3 are periods of the sail for C. 


Similarly the vertices vo and v, are in the sail of 7Py)OP>. Therefore, the sail for 
ZPoOP) consists of the following three parts 


° Si a part of the sail contained in PoOvo; 
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° SS: a part of the sail contained in voOv); 
° S3: a part of the sail contained in vj OP); 


Let us observe that 


Therefore, the difference of the LLS sequences for the angles ZPp9OP3 and ZPoOP, 
is precisely a period of the LLS sequence for C. This period corresponds to the 
sequences between the points v; and v2. This period is an LLS period for M as 
M(v,) = v2. After considering this period up to a cyclic shift we have the LLS cycle 
for M. This concludes the proof. 


Remark 8.26. Note that it is not sufficient to consider the difference of the LLS se- 
quences for the angles 7P)OP and ZPoOP,. Here it is not possible to determine one 
of the integer sines of the periods. Let us illustrate this with the following example. 


Consider a matrix 
Spee 12 
~A\13 


P, =M(Py) = (2,1), P, =M?(Py) = (4,5), and P3; = M?(Po) = (14,19). 


and set P = (4,—1). Then 


The LLS sequences for the angles ZPy9OP,, ZP9OP) and ZPoOP; are respectively 


We have 
LLS (ZPOP3)) —LLS(ZPOP>3)) = (1,3,1,2,1,3,1) — (1,3,1,3,1) = (2,1) 
which is a correct period for the LLS sequence of M, while the difference 
LLS(ZPOP2)) —LLS(ZPOP;)) = (1,3,1,3,1) —(1,4,1) 


is not even defined. 


8.6.3 LLS cycles for GL(2,Z) matrices 


Let us extend the result of Proposition 8.23 to GL(2,Z) matrices with distinct (not 
necessarily positive) irrational eigenvalues. (See also in [112].) 
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Corollary 8.27. Let a GL(2,Z) matrix M have distinct irrational eigenvalues. Let 
also P be any integer point distinct from the origin. Then there exists a difference 


LLS(ZPO(M°(P))) —LLS(ZPO(M‘(P))), 
and the resulting sequence is an LLS period of M repeated twice. 
Proof. By Proposition 8.23 the difference 

LLS(ZPO(M°(P))) — LLS(ZPO(M‘(P))) 


exists and is a period for M’. Finally by the above the resulting sequence is a period 
of the LLS sequence for M repeated twice by Proposition 10.14. 


Remark 8.28. The proof of Corollary 8.27 is based on Proposition 8.23 which we 
prove after giving several important results of the classical Gauss Reduction theory 
are given. Still we prefer to discuss Corollary 8.27 in this section for convenience of 
the material exposition. 


The situation of Corollary 8.27 dramatically changes if we take an integer matrix 
that is not in GL(2,Z). Here we consider matrices with distinct irrational eigenval- 
ues. Such matrices still have periodic LLS sequences, however they do not deter- 
mine shifts of their sails. We have the following conjecture. 


Conjecture 3. For any integers m > n there exists a matrix M ¢ GL(2,Z) with 
distinct irrational eigenvalues such that, for every integer point P 


LLS(ZPO(M™(P))) —LLS(ZPO(M"(P))) 


does not contain a period for the sail of M. 


(Experiments suggest that 7M(P)OM?"(P) might not contain a single funda- 
mental domain of the corresponding invariant cone for M.) 


8.7 Exercises 


Exercise 8.1. Draw a geometric continued fraction defined by the lines x —2y = 1 
and 3x +4y = 3 and calculate the LLS sequences for all the sails. 


Exercise 8.2. Is the statement of Proposition 8.6 true for rational angles (R-angles, 
L-angles)? Find a correct analogue in these cases. 


Exercise 8.3. Prove Proposition 8.9. 


Exercise 8.4. Find the LLS cycles for the following matrices 


(a) 6 a (b) (4), (c) € aT 


Check for 
updates | 


Chapter 9 
Continuant Representation of GL(2, Z) Matrices 


Let us now discuss two remarkable representations of SL(2,Z) and GL(2,Z) re- 
spectively. We start with a classical representation of SL(2,Z) in terms of the mod- 
ular group (see Section 9.1). It turns out that SL(2,Z) can be generated by two ele- 
ments. Further we discuss an alternative approach to represent elements of GL(2, Z) 
(which includes SL(2,Z)) that is based on continuant representation of matrices. 
In Section 9.2 we introduce supplementary material for that approach: we set el- 
ementary matrices and write their products in terms of continuants. After that in 
Section 9.3 we provide matrix continuant representations mentioned above. In fact 
the representation exists for almost all GL(2,Z) matrices (see Theorem 9.6). Finally 
in Section 9.4 we show how to express matrices of SL(2,Z) via classic generators 
discussed in Section 9.1 using continuants. 


9.1 Generators of SL(2,Z) and the modular group 


The group SL(2, Z) is very well studied due to its relations to the modular group in 
hyperbolic geometry. Let us just say a few words about it. Recall that the modular 
group is the group of linear fractional transformations with integer coefficients of 
the upper half of the complex plane. Such transformations can be written in the form 


az+d 
Zh 


cz+d 


(see e.g. in [4]) and represented by two matrices + (< ‘) . Denote this transforma- 
‘ ab 
tion by é ‘l , 


Here the matrix can be taken up to the projective invariance, i.e., up to the mul- 
tiplication by —1. By that reason the modular group is denoted by PSL(2,Z) (see 
more about modular groups in [184]). One can say, that 
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PSL(2,Z) = SL(2,Z)/{+Id}. 


It is well known that PSL(2,Z) is generated by two elements (see, e.g., [199]): 
1 
S:zhHy—-- and T:zrH z+. 
Zz 


Moreover the modular group can be defined by generators and relations as follows: 
PSL(2,Z) = (S,T|S? =I, (ST)? =I), 


where / is an identical transformation of the upper half plain. 


In analogy to PSL(2,Z), the group SL(2,Z) can be generated by matrices repre- 
senting the transformations S and T, namely by 


0-1 11 
ms=({ 7) and m= (94): 


SL(2,Z) = (Ms,Mr|Mg = Id, (MsMr)° = Id), 


Here we have 


Remark 9.1. Our intention for this section is to study an alternative representation 
of GL(2,Z) matrices in terms of products of the matrices of type 


01 

la)’ 
Such approach is based on geometry of numbers. We discuss it further in this chap- 
ter. 


9.2 Basic properties of matrices M,, a, 


Let us set the following general notation. 


Definition 9.2. Let a be a real number, denote by M, the following matrix: 


01 
moO) 


Now let (aj,...,@,) be any sequence of real numbers, we set 


“(01 
May,....an om Il (; : 


k=1 


9.2 Basic properties of matrices Mg, a, 99 


We continue with the following remarkable result for matrices Mg, ....a,- 


Proposition 9.3. Let n > 0 and let (a1,...,a,) be any sequence of real numbers. 
Then we have 


M = Ky-2(a2,.--,An—2) Ky—1(a2,-..,4n) 
Ble NN Rint (Gty 245-5 Gn=1) Ka(Q1,42ys0.4Gn) 


In addition, we have 
detM = (—1)”. 


Example 9.4. Let us consider the following simple example of a GL(2, Z) matrix 
7 32 
wa & a) , 
The continuant form for M and the corresponding product matrix representation are 


as follows 
M- K(—3,—2) K3(—3, —2,5) 
~ \K3(3,-3, -2) K4(3,-3,-2,5) ]° 


Hence M is represented by the following even sequence: 
(3,-3,—2,5). 
Therefore, we have 
M = M3 3-25 = M3 -M_3-M_2- Ms. 


Note also that 
detM = (—1)*=1. 


Proof. The proof of the proposition is by induction in n. 


Base of induction. For n = | we have 


= (Gar) = Cra ice) 
For n = 2 we have 


_ fil aa _ Ko() Ki (az) 
May .ar = Ma, Ma, = (, ) _ ey Ko(a1,a2) ; 


Step of induction. We have 
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Ky—2(a1,.--;4n—2) Kn—1(a2,.--,4n) , 0 1 = 
Kn-1 \,.-.,@n—1) K,(a1,---,4n) 1 n+ 
Ky 


( 
(a2,...,@n) Kn—2(@2,..-,@n—2) +€n41Kn_1(@2,.--,4n) 
a 
( 


1,-++5n) Ky-1(a1,---;4n—1) + n41Ky(a1,---,n) 


Kn-1 2,..-,4n) Ky (@2,---,4n+1) 
Ri Gijei@n) Rie Gl inca) 


The last inequality is a direct corollary of Proposition 1.13. This concludes the 
proof for the induction step. 


Finally, since det M, = —1 we have 


detM = (—1)". 


9.3 Matrices of GL(2,Z) in terms of continuants 


Let us start this section with the following definition. 


Definition 9.5. Let n be a positive integer. Consider the following set of sequences: 
Tn ={(a,.--,€n)| a1 € Zs ay,...,Qn-1 € D4; an € Zh. 
Set also 


j eee —9 ym 
k=1 


We have the following statement. 


Theorem 9.6. For every matrix 


in GL(2,Z) excluding matrices 


0-1 
E 6 : ) k=1,2,3,... 


there exist a unique integer n > 0 and a unique sequence of numbers (a),...,dn) € 
I_,— such that the following holds. 
(i) Ifa £0 then 
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= = Ky-2(a@2,.--;4n—2) Kn—1(@2,---,4n) 
HS 6 (Mace = cane) (ge 


Here the sequence (a1,...,4n) satisfies 


b 
— = [aj3a2 3 +++: dn_1], 
a 
where we take the odd regular continued fraction for b/a when det(M) = 1, and the 


even regular continued fraction for b/a when det(M) = —1. 
Finally, the value of ayn is the solution of the following linear equation: 


Kn-1(d2,---;Qn—1,X) =C. 
(ii) For the case a = 0 we have: 


01 
M= (; 2 =M; M 


Finally the matrices 
_fo-1 
“AL k 


are the only matrices that (up to multiplication by —1) do not have the continuant 
representations satisfying the above conditions. 


lI 
Goa 
Lo 
an 
light 
Pe oe 
ees 

lI 

I 
Ss 


Remark 9.7. Note that SL(2,Z) matrices correspond to sequences with an even 
number of elements, while GL(2,Z) with negative determinant are represented by 
sequences with an odd number of elements. This follows directly from Proposi- 
tion 9.3. 
Remark 9.8. As one can see, we have many expressions for GL(2,Z) matrices de- 
fined up to a sign of a matrix. This is closely related to the modular group PSL(2, Z) 
discussed above. Similar to PSL(2,Z) one can define PGL(2, Z): here we take all 
the matrices of GL(2,Z) up to the same equivalence M ~ —M and denote them by 
[M). 

Theorem 9.6 has the following nice corollary on the structure of the group 
PGL(2,Z). 


Corollary 9.9. The mapping 


{£Ma,, 


is a bijection of 
PGL(2,Z) \ { ? Fi kk E 2} 


and the set of all sequences I_..—. 


Proof. This corollary follows directly from the formulae of Theorem 9.6. 
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Example 9.10. Let us start with several examples of matrices containing a zero 
element. On the left hand side we write a matrix M and on the right hand side we 
show the sequence (a1,...,@,) such that 


M = Ma,....ay 
We have the following cases 
1k 1k 
01 (0,k) 0-1 (—1,1,k—1) 
10 1 0 
01 0-1 . 
1 2 (k) ot ) Not available 
k 1\| (0,4,0) k —1\|(0,k—1,1,-1) 
10/|k>0 10 /\|k>1 


with the last irregular matrix in the last family for k = 1: 


Go) 


Example 9.11. Let us now discuss cases formed by different variations of vectors 
(3,4) and (2,3). Here we multiply one or two of the vectors by +1 and/or simulta- 
neously swap the order of their coordinates or their orders. 


— 


i) 


32 a) 

) (1,2, 1,0) i = 
=) ees) 

a (y3p$1) oF 7) a ee 


43 4 3 
¢ i) (0,1,2,1,0) & =) ~1,3,1,0) | 
4-3 4 -3 
6 3) (0,1,3,—1) & ) -1,4,-1) ia 


Proof of Theorem 9.6. Item (i). Here we assume that a 4 0. We start with the case of 
SL(2,Z) matrices (i.e., the integer matrices with unit determinant). Recall that b/a 
has a unique odd regular continued fraction: 


= [a15-++ 1 aay-1]. 
So we set 
@= Kon 1(@1,...,42n-1) and b=Koy_1(a1,...,@2n-1) 


keeping in mind a general expression of Remark 1.18: 
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Kon—1(a1,. Be ,42n-1) 
Kon—2(a2,..-,@2n-1) 


= [aq3-++ t a2n—1]. 


The uniqueness of the odd continued fraction implies the uniqueness of the elements 
(a1, ,@2,-1) defining the continuants for a and b. 

Let us show how to find a,. Here we note that once (a,b) is fixed then the integer 
distance from (c,d) to the line through (0,0) and (a,b) is 1, and in addition (a,b) 
and (c,d) should form a positively oriented basis of R* (since detM = 1). Such 
points (c,d) should be an integer point of the line 


eas it 2) tags Gas bat say 
n ’ 


Koy 34 (Gis. Qn1) Kon_2(@1,---,@2n-2) 


where all integer points are attained for integer values of the parameter ay. It is clear 
that once we know the integer point (c,d), the value of a, is uniquely defined (see 
Fig. 9.1, Left) as the solution of the equation: 


C\ _— ( Kon—2(a2,...,@2n-1) % Kon-3(d2,.--,@2n—2) 
— XxX 3 
d Kon-1(@1,..+,@2n—1) Kon-2(a1,..-,2n—2) 


which is equivalent to 
c\ _ ( Kan-1(a2,---,@2n-1,5) 
d Kon (a1,.--,@2n—1,*) 


In particular (since det(M) = 1) the last system is equivalent to the linear equation 
Kyn—1(2,..-,@2n—1,X) = €. 


This concludes the proof for SL(2,Z) matrices. 
The proof for GL(2,Z) matrices with negative determinants is analogous, it is 
left as an exercise. This concludes the proof of Item (i). 


Item (ii). The first statements of this item are straightforward. The matrix 


_fO-1 

“AL k 

does not have a continuant representation by the following reason. One gets a = 0 
if and only if 


Kn-2(a2,--.,4n—1) =0 


with non-negative values a2,...,d@,—1. This is only possible in the exceptional case 
when n = 1, i.e., for the sequences (a1). It remains to say that 


Mum (a)A(re) 
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. r) . . . 


a3=-1 


Fig. 9.1 Possible values for (c,d) with (a,b) = (2,3) are on the two lines lines: one line for the 
case of SL(2,Z) matrices (Left) and one line for the case of det(M) = —1 (Right). 


for any values of a and k. This concludes the proof. 


Let us illustrate the proof of Theorem 9.6(i) with the following example. 


Example 9.12. Let us find all possible integers c and d such that the matrix 
2 
3d 


First, let us find all SL(2, Z) matrices. In this case we consider the odd continued 
fraction for 3/2: 


is in GL(2,Z). 


3 
2 
Now, by Theorem 9.6 all SL(2,Z) matrices will be listed by the sequences 


= (1,1: 1]. 


(1, 1, 1,a4). 
Then, we have 
2 K3(1,1,a4) — (22a4+1 
3 Ka(1,1,1,a4)/ ~~ \3 3aa4+2)’ 
where a4 is any integer. For instance if a4 = —1,0,1, then the vectors (c,d) are 


respectively as follows 
Catal), (12); (3,5) 


(see Fig. 9.1, Left). 


9.4 An expression of matrices in terms of Ms and Mr 105 


Now for the case of negative determinant we have: 


-= (1; 2]. 


Now, by Theorem 9.6 all SL(2,Z) matrices will be listed by the sequences 


(1,2,a3). 
Then, we have 
2 K(2,a3) oa 22x+1 
3 K3(1,2,a3)) \33x+1)’ 
where a3 is any integer. For instance if az = —1,0,1, then the vectors (c,d) are 


respectively as follows 
(1,2), (1.4); (3,4) 


(see Fig. 9.1, Right). 


9.4 An expression of matrices in terms of Ms and Mr 


Finally we would like to mention that there is a simple way to write down an ex- 
pression for any SL(2,Z) matrix in terms of generators Ms and Mr (introduced in 
Section 9.1 above) based on the statement of Theorem 9.6. 


We start with expressions for the exceptional matrices 


O-1\ ax QL: = Bina 
é ¢) Mi and -(j 1 )=3 M;. 


By Theorem 9.6 any other SL(2,Z) matrix M can be represented as 
M=— May.,....ar or M= —Ma,, ; 


for some sequence of even number of integers as in Theorem 9.6. 
Now the formula for M follows from the following two observations: 


Map = M§«(Ms-Mz“)-(Ms-M?) and Ss? =—-1, 


and the fact that 
n 
May....,aon = [ [Maa 
i=l 


We have 
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May....,a%, = II (ui : (Ms -M,"") : (Ms -M?)) and 
n 
~My stay =] (MQ (Ms: Mp) «(Ms -M)). 


Remark 9.13. Note that the above observation provides the evidence to the fact that 
Ms and Mr generate SL(2,Z). 


9.5 Exercises 


Exercise 9.1. Find the representation in terms of the continuants of the following 
matrices 


(a) € ve (b) & 5) (c) e a) 


Exercise 9.2. Find a decomposition in Ms and Mr of the following matrices 
15 7 —13 —3-2 
OC) "Oe Oe.) 


Exercise 9.3. Find all possible integers c and d such that the matrix 
7c 
5d 


Exercise 9.4. Write the coefficients of the matrix Ma, 
follows: 
(a) (1); (b) (1,2,3,4,5); (ce) (—3,2,1,4,-2); (d) (0,2,4,2,0). 


Exercise 9.5. Finish the proof of Theorem 9.6(i), namely study the case of GL(2, Z) 
matrices with negative determinants. 


is in GL(2,Z). 


jenn» £00 (G],...,dn) being as 


| ® 
Check for 
updates 


Chapter 10 
Semigroup of Reduced Matrices 


There are several ways to construct reduced matrices, however as a rule they are 
closely related with each other. The reason for that might be the structure of the 
group. We should mention that the approach here is rather different to the classical 
approach for closed fields via Jordan blocks. 


In this chapter we choose reduced matrices in a rather natural way such that they 
form a semigroup with respect to the matrix multiplication. An extra benefit of this 
choice is as follows: there exists a simple description of such matrices, we discuss 
it in Section 10.1 (see Theorem 10.7). 


An important property of reduced matrices is that every integer conjugacy class 
of GL(2,Z) possesses at least one reduced matrix in it, we show this in Section 10.2. 
In general there might be more than one reduced matrix in a conjugacy class, but as 
we show in Chapter 11 the number of such reduced matrices is finite. 


10.1 Definition and basic properties of reduced matrices 


We start in Subsection 10.1.1 with the definition of a reduced matrix. Then we com- 
pute continuant representations of all reduced matrices in Subsection 10.1.2. Fur- 
ther In Subsection 10.1.3 we prove a simple necessary and sufficient condition for 
a matrix to be reduced. Finally in Subsection 10.1.4 we specify the LLS cycles for 
reduced matrices (via the elements of the matrix). 


10.1.1 Reduced matrices 


We start with the elementary building blocks for generating all reduced matrices. 


Definition 10.1. For a positive integer a we say that the matrix 
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is the elementary reduced matrix. 


Starting with elementary reduced matrices we generate the semigroup of reduced 
matrices. 


Definition 10.2. We say that a matrix is reduced if it is a product of elementary 
reduced matrices. 


Directly from the definition of reduced matrices we get the following remarkable 
property. 


Proposition 10.3. The set of all reduced matrices is a semigroup with respect to the 
matrix multiplication; it is freely generated by elementary reduced matrices. 


Proof. The fact that the set of all reduce matrices is a semigroup follows directly 
from the definition. It is freely generated by the next theorem (Theorem 10.5), where 
we show that the LLS cycle for Mg, |... a, 18 (a1,..-,@n). Therefore, different products 
of elementary reduced operators have different LLS cycles. Recall that LLS cycles 
are invariants of matrices, and hence the semigroup is free generated. 


Remark 10.4. (On a complete invariant of reduced matrices.) Proposition 10.3 
implies that the set of hyperbolic reduced matrices is in one-to-one correspondence 
with the set of finite sequences of positive integer elements. Namely, the map 


(Biseriyla) Mae cae 
is a bijection. Furthermore, as we show later in Theorem 10.5 the sequence 
(a1,.--,4n) 
is the LLS cycle for Mg, 


10.1.2 Continuant representations of reduced matrices 


In the proof of Theorem 10.7 we use the following important structural theorem. 


Theorem 10.5. Let n, a1,...,dn be positive integers. Consider the matrix 


Then the following two statements hold. 
(i) We have 
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We Kya (Gaye s5Qyt) Kyu (Gd) 000 5Gq) 
Kn—1(@1,42,---;4n—1) Kn(a1,€2,.-.,@n) , 


(ii) The LLS cycle of M is 
(a1,42,...,n). 


Example 10.6. For the case of the above matrix 


we have 


and the LLS cycle for M is (1,1,2,2). 


Proof. Item (i) follows directly from Proposition 9.3. 


Item (ii). Consider the sequence of integer points 
(xe,¥e) = M*(1,0), fork = 1,2,3,.. 
By Item (i) for every k the coordinates x, and y, are relatively prime and 


k 
YR = [(aysag : +++: an). 
Xk 
Therefore, by Theorem 3.1, all the points (x;,y,) are vertices of the principal parts 
of the sail the periodic continued fraction 


O = [(aj3a2 2 +++: dy)). 


Therefore, the direction of the vector (1, a) is the limiting direction for the sequence 
of directions for the vectors (x;,,;), and thus we have 

lim ~* =a. 

ko Xf 
Hence (1,@) is one of the eigenvectors corresponding to the maximal eigenvalue. 
By construction, the LLS sequence for @ is periodic with period 


(a1 ,d2,...,An). 


Finally by Proposition 4.13 the sail for @ starting from some element coincides 
with the sail for M. Therefore, since the sail for M is periodic, the sail for @ has the 
same period, i.e., 

(a1,d2,...,An). 


Furthermore, 
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2 
M (1,0) = May,...,an,a} sn (1,0) 
— (Ron Ga ia any tly (Ron Ai geectaqsaiy ns ya4-4) 
Hence M? acts as a shift by n vertices on the sails associated to M. Therefore, by 


Definition 8.13 the LLS cycle of M is (a1,a2,...,a,). This concludes the proof of 
Item (ii). 


10.1.3 A necessary and sufficient condition for a matrix to be 
reduced 


Now we formulate an important property of reduced matrices that allows us imme- 
diately to decide whether a given matrix is reduced or not. 


Theorem 10.7. A matrix 


in GL(2,Z) is reduced if and only if 
d>b>a>0. 


Example 10.8. For instance, the matrix 


(31) 


is reduced. In fact, this matrix is written as the product of elementary reduced ma- 


trices as follows F j 
37\  /01 : 01 
512) \11 12) ° 


Remark 10.9. In some manuscripts Theorem 10.7 is considered as the main defini- 
tion of reduced matrices. 


Proof of Theorem 10.7. Necessary condition. Let M be a reduced matrix. Then by 
Theorem 10.5 there exist positive integers n, a,...,da, such that 


M= Kyn—2(a2,.--,4n—1) Kyn—1(a2,---;4n) 
Kn—1(41,42,+-+;4n-1) Ky, (a1,42,---;4n) , 


Hence the condition 
d>b>a>0 


is equivalent to the condition 


K,(a1,..-,dn) > Kn—1(a1,---,4n—1) = Kn—2(a2,.--,An—1) = 0. (10.1) 
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The first inequality follows directly from 


Ky(a1,---,4n) = GnKy—1(41,.--,Qn—1) + Kn—2(a1,---;Gn—2) 
> Kn-1(@1,..-,@n—1)- 
Here we have the equality if and only ifn = 1 anda, = 1. 


The second inequality of Inequality 10.1 is proven by analogy (keeping in mind 
symmetry of the continuant for palindromic substitutions). 


Finally, by the construction, the continuants are nonnegative. This concludes the 
proof of the necessary condition of the theorem. 


Sufficient condition. Let us now start with a matrix 
ac 
M — 


d>b>a>0. 


that satisfies the condition 


First of all in the case where a = 0 we immediately have that b = 1. Hence we 
have reduced matrices. 


Now let a > 0. In this situation we consider two different cases with respect to 
the sign of the determinant of the matrix M. 


Case I. Let detM = 1. Then consider the odd continued fraction for b/a: 


b 


— = [aj3a2: +++! dan_1]. 
a 


In particular, we have 
a = Kyy~2(a2,...,42n-1); 
b = Kon-1(a1,42,..-,42n-1)- 


Since detM = 1, there exists a positive integer 7, such that 


C = AKon_2(a2,..-,G2n—1) + Kon-3(@2,-- +, @2n—2)3 
d = AK2n-1(1,42,.-.,@2n—1) + Kon_2(a1,€2,...,@2n-2), 


or, equivalently, 
¢ = Kyn_1(d2,...,42n-1,4);3 
d = Kpy(a1,..-,42n-1,A). 
Hence 
Ma ( _Kon-2(dar-++12n-1)  Kon-1(42,+++,42n-1,4) 
Kon—1(@1,42,.--,42n—1) Kon(ai,..-,@2n-1,4) 


Therefore, by Theorem 10.5 
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Case II. Now let detM = —1. Then consider the even continued fraction for b/a: 
b 
- = [a13a2 ga iehee an| 
a 

and similarly construct A from the condition detM = —1. 


In analogy to Case I, we have 


M=M,,:...-M 


42n 


-M,. 


We leave further details of the proof as an exercise for the reader and conclude Case 
IL. 


Finally it remains to say that for the case d = b > a > 0 the only possible matrix 


(ri) 


which is reduced by definition. This concludes the proof of the sufficient condition 
of the theorem. 


is 


10.1.4 LLS cycles of reduced matrices 


It turns out that it is very simple to write the LLS cycles for reduced matrices with 


given coefficients. 
Corollary 10.10. Let 
ac 
m= (54) 
be a reduced matrix. The LLS cycle for M is 
(a1,d2,-..,An). 


Here n is even for SL(2,Z) matrices and n is odd for GL(2,Z) matrices with nega- 
tive determinant. Then the numbers a\,a2,...,Qy are defined as the elements of the 
regular continued fraction 


ae [4,302 2+++ 2 ay]. 


In addition we have 
M=— Ma, 94J5-++,An* 


Proof. By Theorem 10.5 and Remark 1.18 we have 


d a Kn(a1,42,---,4n) 
c Kyn—1(a2,42,.--,n) 


= [aj;a2 +++: an] 
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hence 
M = May ap,...,an> 


and the LLS cycle for M is (a), a2,...,@n)- 


Example 10.11. Let us write a period of the LLS sequence for the matrix 
3 7 
le € 5 , 


det(M) = 1 


First note that 


and hence we should take the even regular continued fraction. Then 


d 12 
—=— =/1351:2:2I). 
7 (1; ] 


Cc 


Therefore, one of the LLS periods for M is (1,1,2,2). 


10.2 Existence of reduced matrices in every integer conjugacy 
class of GL(2,Z) 


We have the following rather straightforward statement on existence of a reduced 
matrix integer conjugate to a given one. 


Theorem 10.12. Let M be a hyperbolic GL(2,Z) matrix with irrational eigenvalues. 
Then there exists a reduced matrix M’ such that M is integer conjugate either to M' 
or to —M'. 


Proof. Let (a1,...,dn) be the LLS cycle of M. Set M’ = Mg,,...a,- Then by Theo- 
rem 10.5(ii) the reduced matrix M’ has the same LLS cycle as M. Hence by Corol- 
lary 8.16 the matrix M is integer conjugate to either M’ or to —M’. 


Remark 10.13. Reduced matrices are used further in the Gauss Reduction Theory. 
We list all reduced matrices integer conjugate to a given one later in Theorem 11.8. 


We conclude this chapter with the following important result. 


Proposition 10.14. Let M be a hyperbolic GL(2,Z) matrix with irrational eigen- 
values. Then the LLS cycle of M? is a twice repeated sequence of integers. 


Remark 10.15. A non-trivial case here is the case of matrices that have negative 
eigenvalues. Such matrices have odd LLS cycles. 


Proof. First of all let us study the LLS cycles of reduced matrices. Let 
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be a reduced matrix for the sequence of positive integers (a),...,d@,). Then from 
Definition 9.2 we have 


Hence the LLS cycle for M? is twice the LLS cycle for M. 


For an arbitrary M in GL(2,Z) with irrational real eigenvalues it is known that 
M is integer conjugate to +M', where M’ is a reduced matrix (see Theorem 10.12). 
Now recall that the LLS cycles are invariant under both integer conjugation and 
multiplication by —1. Therefore, the period of LLS sequence corresponding to M” 
will be twice the period of the LLS sequence for M. 


10.3 Exercises 


Exercise 10.1. Write the following reduced matrices as products of elementary re- 
duced matrices: 


7 23 513 7 36 
(a) é ae (®) G a (©) & na 


| ® 
Check for 
updates 


Chapter 11 


Continued Fractions and SL(2,Z) Conjugacy 
Classes. Elements of Gauss’s Reduction Theory 


In this chapter we study the structure of the conjugacy classes of GL(2,Z). Recall 
that GL(2,Z) is the group of all invertible matrices with integer coefficients. The 
group GL(2, Z) has another commonly used notation: SL; Z, indicating that all ma- 
trices of the group has determinants equal either to 1 or to —1. 

We say that the matrices A and B from GL(2,Z) are integer conjugate if there 
exists an GL(2,Z) matrix C such that B= CAC™!. 

Notice that in the case of algebraically closed fields (say in C), every matrix 
is conjugate to its Jordan normal form. The situation with GL(n,Z) is much more 
complicated, since Z is not a field. A description of integer conjugacy classes in 
the two-dimensional case is the subject of Gauss’s reduction theory, where conju- 
gacy classes are classified by periods of certain periodic continued fractions (for 
additional information we refer to [83, 143, 113, 145]). We present first steps in the 
study of multidimensional Gauss reduction theory in Chapter 25, in the second part 
of this book. 

In this chapter we discuss the main elements of classical Gauss reduction theory 
based on the theory of geometric continued fractions studied in previous chapters 
(see also [105]). In particular, we discuss some statistical questions and formulate 
several open problems. 


11.1 Conjugacy classes of GL(2, Z) in general 


It is natural to split GL(2, Z) into the following three cases. 

Elliptic case (complex spectra): Consider GL(2,Z) matrices whose characteristic 
polynomials have a pair of complex conjugate roots. There are exactly three integer 
conjugacy classes of such matrices (see Theorem 11.2), represented by 


io) Gio) mt 4): 
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Parabolic case (rational spectra): Let us now study matrices whose characteristic 
polynomials have rational roots. (This contains the degenerate case of two coin- 
ciding roots as in this case that would be a double root of a quadratic polynomial 
with integer coefficients, which actually equal to 1). Such matrices have eigenvalues 
equal to +1, each of these matrices is integer conjugate to exactly one matrix of the 
following families: 


lm 

an for m>0; 
—l—-m 

0 =) for m>0; 


See Theorem 11.6 below (see also in [16]). 


Hyperbolic case (real irrational spectra): The case of hyperbolic matrices with 
real distinct irrational eigenvalues is the most complicated. The conjugacy classes 
of matrices can be represented by reduced matrices introduced in Chapter 10. LLS- 
cycles here will be complete invariants (up to the multiplication by +1). However 
there can be more than one reduced matrix in a conjugacy class; the number of such 
matrices is finite and it is equivalent to the minimal period of the LLS sequence for 
the corresponding to conjugacy class of matrices (see Theorem 11.8). For additional 
information on Gauss Reduction Theory we refer to [83, 143, 113, 145] (including 
questions of matrix similarity in [38)]). 


11.2 Elliptic case 


The matrices that we study in this section are rather important as all of them are 
of finite order. In some sense they are similar to finite order rotation matrices of 
Euclidean geometry. Invariance under the action of such matrices implies symmetry 
properties for the objects. Despite the Euclidean case, the orders of symmetries can 
take only the values 1, 2, 3, 4, and 6. This in particular implies that there are no 
integer regular polygons of orders 5 and n > 7 (see Proposition 2.26). To show this 
we prove the following proposition. 


Proposition 11.1. Let A be a GL(2, Z)-matrix with complex conjugate eigenvalues, 
then either 
Ae=Id, or A*=Id, or A® =Id. 


Proof. First of all note that the characteristic polynomial of A in variable t is 
t” —tr(A)t + det(A) 


(here tr(A) denotes the trace of A) and its discriminant is 


11.2 Elliptic case 117 
tr?(A) —4det(A). 


If detA = —1, then the discriminant is positive, and the eigenvalues are real (we do 
not consider such case). Therefore, detA = 1, and the discriminant is 


te (Aj: 


Since all the elements of A are integers, there are only three values of the trace that 
secure negative values of the discriminant: —1,0, 1. The corresponding characteris- 
tic polynomials are respectively factors of the polynomials 


x1 = 1s. “and Si, 


2 % 


Therefore, all the eigenvalues are the corresponding roots of the unity. Since matrix 
A is real and its eigenvalues are complex conjugate, the eigenvalues are distinct to 
each other. This implies that matrix A is diagonalizable in some complex basis, and, 
therefore, either A?, or A*, or A® are identity matrices. 


Let us prove the classification theorem now. 


Theorem 11.2. Any GL(2, Z)-matrix with complex conjugate eigenvalues is integer 
conjugate to precisely one of the following three matrices: 


(Vi) (Go)s ee Ga): 


Proof. Consider a GL(2,Z)-matrix A whose eigenvalues are complex conjugate 
numbers. Then the level sets of the MD-characteristic for A are ellipses. 


Consider an integer point v with minimal possible value of the MD-characteristic. 
This point will be on the ellipse that does not contain integer points in its interior 
except for the origin. By Proposition 7.11 A(v) is also on this ellipse, and hence the 
triangle with vertices O, v, and A(v) is empty. 


Hence the vectors v and A(v) generate Z”. Then in the basis (v,A(v)) the matrix 
A will be of the form 
Om 
ln}- 


Firstly, since detA = 1, we have m = —1. 
Secondly, since trA € {—1,0,1}, we have n € {—1,0,1}. 
Therefore, A is integer conjugate to precisely one of the following three matrices: 


(Vai)) Go)» a (13): 


This concludes the proof. 
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Remark 11.3. Assume we are given a GL(2,Z) matrix M with complex eigenvalues. 
Then it is very simple to determine which of one of the three above matrices is 
integer conjugate to M. We should compute the trace of M. It could be either —1, or 
0, or | which will completely determine the integer conjugacy class of the matrix. 


11.3 Parabolic case 


In this subsection we study conjugacy classes of matrices that have a rational eigen- 
value. Let us start with the following preliminary proposition. 


Proposition 11.4. Let M be a GL(2,Z) matrix with a rational eigenvalue. Then the 
following hold. 

(i) All the eigenspaces of M are integer, namely, they contain integer points. 

(ii) There exists an integer m such that the matrix M is integer conjugate to either 


(oi) 0%) # G4) 


Remark 11.5. In particular, if a GL(2, Z)-matrix M has a rational eigenvalue, then 
both its eigenvalues are in the set {—1, 1}. 


Proof. Item (i). This follows directly from the standard procedure of finding eigen- 
vectors. We leave this as an easy exercise on linear algebra. 


Item (ii). Assume M has a rational eigenvector. If M is a scalar multiplication ma- 
trix, then the fact that | det M| = 1 implies that M is +Id. In the case where M is not a 
scalar multiplication matrix, there exists a one-dimensional eigenspace correspond- 
ing to the rational value, that is integer by Item (i). Let v be an elementary integer 
vector (i.e., with relatively prime coordinates) of this eigenspace. Now consider any 
integer basis of the lattice Z”, that contains v as the first element of the basis. In this 
basis the matrix M has a form 
lm 
(on): 


l=+1 and n=l. 


Since M € GL(2,Z), we have 


Therefore, the eigenvalues of M are either 1 or —1. In the case where M has an 
eigenvalue 1, from the above we have the following: M is integer conjugate to either 


oi) * 6%) 


Finally if both of the eigenvalues equal —1, then M is integer conjugate to 
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—lm 
0 -1)° 


The classification theorem in the parabolic case of rational spectra matrices is as 
follows. 


This concludes the proof. 


Theorem 11.6. Let M be a GL(2,Z) matrix with a rational eigenvalue. Then M is 
integer conjugate to a precisely one matrix of the following list: 


¢ SL(2,Z) matrices ({ “ , where m € Z50; 


¢ SL(2,Z) matrices & ae , where m € Zs0; 
° finally GL(2,Z) \ SL(2,Z) is represented by the following two matrices 


A) a (4) 


First of all we prove the following lemma. 


Lemma 11.7. Two matrices 


are integer conjugate if and only if my = =m. 
The proof of this lemma is a nice exercise in linear algebra. 


Proof. Recall that M, and Mp) are integer conjugate if and only if there exists a 
GL(2,Z) matrix A such that 
M, =AM>A~! 


or, equivalently, 
M\A =AM). 


a= (35): 


lm fac\_ a+mbc+mid 
0 1 bd b d 


ac\- lm2\ — famatc 
bd 01) \bmb-+d)° 


If the case where the above two matrices coincide we have 


Assume that 


Then we have 


o) 


a=a+mb 
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(here we compare the top diagonal element of the matrix). Therefore, 
mb =0. 


If m, = 0, then M is the identity matrix and it is not (integer) conjugate to any other 
matrix. If m; #0 then b = 0, and hence 


a=+l and d=. 


Now let us compare the top right elements of the matrices. We have 
c+md=ma+tc. 


The last is equivalent to 
md = moa. 


Since a,d = +1 we have 


m, ==mM). 


So if the absolute values of m, and mp2 are distinct the matrices M; and M2 are 
not integer conjugate. 


Finally it remains to show that if m, = —mz then M, is integer conjugate to M2. 


Indeed, we have 
l1—-m\ /10 lm 10 
01) \O-1 01 0O-1/)° 


This concludes the proof for the case of two unit eigenvalues. 


Proof of Theorem 11.6. Let us study first the case when both eigenvalues of M equal 
1. Then by Proposition 11.4 the matrix M is integer conjugate to 


(0%) 


and hence we get the matrices of the first item in Theorem 11.6 by Lemma 11.6. 


The case of two eigenvalues equal to —1 is similar to the above one. Here we 
have a similar classification after multiplying matrices by —1. 


Finally let one of the eigenvalues of M equal | and the other equal — 1. By Propo- 
sition 11.4 the matrix M is equivalent to 


(0) 


for some integer m. The last matrix has two eigenvectors: 


v= (1,0) and w = (—n,2). 
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Fig. 11.1 Two options for the case of distinct rational eigenvalues: LLS = (1) (Left) and LLS = (2) 
(Right). 


Now two matrices with rational eigenvectors are equivalent if and only if the LLS 
sequences of the angles with edges containing eigenvectors coincide. In our case we 
have two different cases here: 


* the LLS sequence is equal to (1) when 7 is even; 
* the LLS sequence is equal to (2) when n is odd. 


(See Fig. 11.1.) 
The integer conjugacy classes of these two cases of matrices are represented by 


(5) (4) 


This concludes the proof for the last case of eigenvalues of different signs. 


11.4 Hyperbolic case 


In this section we study conjugacy classes of GL(2, Z)-matrices with real irrational 
eigenvalues. In particular we show that LLS cycles are complete invariants of the 
conjugacy classes and list all reduced matrices integer conjugate to a given one. As 
an important consequence we get the statement on algebraicity of periodic sails. 


11.4.1 The set of reduced matrices integer conjugate to a given one 


The following structural theorem provides description of conjugacy classes in 
GL(2,Z). 
Theorem 11.8. Let M be a hyperbolic GL(2,Z) matrix with irrational eigenvalues 
and let 

(a1,.--,4n) 
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be its LLS cycle. Finally let m be the minimal length of the period of the LLS cycle. 
Then the list of all reduced matrices integer conjugate simultaneously either to M 
or to —M consists of m matrices of the form 


fork=1,...,m. 
(Here we define indices cyclically modulo n.) 


Proof. We know that two matrices have the same LLS sequences if and only if 
their unions of eigenlines are integer congruent to each other. Hence M could be 
conjugate only to reduced matrices commuting with 


fork =1,...,m. 


(These are the only matrices that have such LLS sequences.) Such matrices are some 
powers of these matrices. 

Finally the shift of the LLS sequence of M? by n vertices uniquely determines 
the reduced matrices. 


11.4.2 Complete invariant of integer conjugacy classes 


Theorem 11.8 implies the following fundamental result. 


Corollary 11.9. (On complete invariant of integer conjugacy classes.) LLS cycles 
are complete invariants of conjugacy classes of PGL(2,Z) matrices with distinct 
positive real eigenvalues (i.e., up to the multiplication by —1). Namely, we have the 
following two statements: 

(i) If matrices M, and My have the same LLS cycles, then M, is integer conjugate 
to either M2 or —M). 

(ii) Any arbitrary cyclic sequence is realised as the LLS cycle for some integer 
conjugacy class of hyperbolic GL(2,Z) matrices with irrational eigenvalues. 


11.4.3 Algebraicity of matrices with periodic LLS sequences 


We conclude this section with a general statement on angles with periodic LLS 
sequences. 


Corollary 11.10. A sail with a periodic LLS sequence is algebraic (i.e., a sail of 
some hyperbolic matrix in GL(2,Z)). 


Proof. From Corollary 11.9 we have reduced matrices for all finite sequences as pe- 
riods. Then in Proposition 8.9 we showed that sails with equivalent LLS sequences 
are either integer congruent or dual. Therefore, any sail with a periodic LLS se- 
quence is algebraic. 
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11.5 Computation of all reduced matrices integer conjugate to a 
given one 


11.5.1 Explicit computation via LLS cycles 


Remark 11.11. Theorem 11.8 provides technique to compute all reduced matrices 
integer conjugate to a given matrix M. Here one could use the algorithm to construct 
the LLS cycles for matrices (see Sections 8.6). After the LLS cycle for a given 
matrix is found, say it is 

(a1,---,4n), 
and let m be its minimal period, we can then write down the reduced matrices in 


accordance with Theorem 10.5. All the reduced matrices conjugate to M will be of 
the form 


RpatlGin Git) Ret (GiG 6025 Geen) 
Kn—1(Ak-415 Qk+25+ ++, Ak+n—1) Kn(Gg41,4k425+++5Ak-+n) 


here k =0,...,m—1. 


We continue with the following example. 


Example 11.12. Consider 


1519 1164 
te (ioe oo) 


The LLS cycle of M is (1,2,1,2). (Here we omit the computation of the LLS cy- 
cle for M, they are done following the techniques discussed in Section 8.6. The 
computations are left as an exercise for the reader.) Hence there are exactly two re- 
duced matrices represented by the sequences (1,2,1,2) and (2,1,2,1). For the case 


of (1,2,1,2) we have 
K2(2,1)  K3(2,1,2) 3 8 
K3(1,2,1) K4(1,2, 1,2) 411)? 


and for the case of (2, 1,2, 1) we have 


(mind) cieizn) (sin): 


So the set of reduced matrices integer conjugate to M is 


(Gn): (sn)} 
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11.5.2 Algorithmic computation: Gauss Reduction theory 


In this subsection we are aiming to construct a reduced matrix integer conjugate to 
a given one using the algorithm similar to the Euclidean algorithm. Such algorithms 
are sometimes treated as Gauss reduction algorithms. 


For the remainder of this section we denote by [[p,r][q,5]| the matrix 


pr 
qsj- 
Algorithm to construct reduced matrices. 


Input data. We are given an SL(2,Z) matrix M = [[p,r][g,s]]. In addition, we sup- 
pose that the characteristic polynomial of the matrix is irreducible over Q (or, equiv- 
alently, that it does not have +1 as roots) and has two positive real roots. 


Goal of the algorithm. To construct one of the periods of the LLS sequence for M. 
Step 1. If g < 0, then we multiply the matrix [[p,r][g,s]] by —Id. Go to Step 2. 


Step 2. We have g > 0. After conjugation of the matrix [[p,r][q,s]] by the ma- 
trix [[1,—|p/q]][0,1]] we get the matrix [[p’,r’][q',s’]], where 0 < p’ <q’. Go to 
Step 3.1. 

Step 3.1. Suppose that g’ = 1. Then p’ = 0, r’ = —1. In addition we have |s’| > 2, 
otherwise, the matrix has either complex roots or rational roots. The algorithm stops, 
and the output of the algorithm is the matrix [[0,—1][1,.s’]]. 


Step 3.2.1. Suppose that g’ > 1 and s’ > q’. Then the algorithm stops, and the output 
of the algorithm is the matrix [[p’,r’][q’,s‘]]. 


Step 3.2.2. Suppose that g' > 1 and s’ < —gq’. Conjugate the matrix [[p’,r’][q',s’]] 
by the matrix [[—1,1][0,1]] and multiply by —Id. As a result we have the matrix 
[pr \[q",s”]| with q” =’, p’ =q' —p’, ands" = —q' —s' > 0. Go to Step 3.2.1 
or to Step 3.2.3 depending on s” and q”. 


Step 3.2.3. Suppose that q’ > 1 and |s’| < |q’|. Notice that the absolute values of q’ 
and s’ do not coincide, since the matrix has unit determinant. Hence |s| < |q|. Then 
we have 


1. 


of] | 2 1 
n= | (q es <q' 
q q 
Go to Step 1 with the matrix [[s’, g’][r’, p’]] with |r’| < |q’|. This is the matrix obtained 
from [[p’, r’][q’, s’]] by conjugation with [[0, —1][—1,0]]. 


E 


Output. The reduced integer matrix that is conjugate to +M. The sign is defined by 
the sign of the trace of the original matrix (i.e., of p+). 

Remark 11.13. During the algorithm we only use conjugations with GL(2,Z) ma- 
trices and multiply by —Id. So in the output one should expect a reduced integer 
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matrix that is conjugate either to M or to —M. Nevertheless, since M, as well as 
any reduced matrix, has all positive eigenvalues, the resulting matrix is conjugate 
exactly to +M. 


Remark 11.14. The algorithm works in a finite number of steps, since after each 
iteration the integer absolute value of g is decreases. 


11.6 Statistical properties of reduced SL(2, Z) matrices 


Without loss of generality in this section we restrict to the case of SL(2,Z) ma- 
trices. Note that there are many different settings to study statistical properties of 
conjugacy classes of SL(2,Z) matrices. For instance in [40] the conjugacy classes 
of elements of SL(2,Z) with given trace are counted. Here we approach the question 
via complexity of reduced operators in the conjugacy classes. 


11.6.1 Complexity of reduced matrices 


As we have already seen, in the majority of conjugacy classes there is more than one 
reduced matrix. Then it is natural to introduce some additional notion of complexity 
to reduce the number of the reduced matrices. Let us introduce one important notion 
of complexity, whose extended definition we will use later in Chapter 25 in the 
multidimensional case. 


Definition 11.15. Let M = [[p,r][g,s]] be a reduced matrix. A ¢-complexity of M is 
the number g. We denote it by ¢(M). 


A geometric interpretation of ¢(M) is as the integer area of the triangle with 
vertices O(0,0), v = (1,0), and M(v). 


Problem 4. Find a reduced matrix with minimal ¢-complexity from a given conju- 
gacy class. 


Remark. The minimal ¢-complexity coincides with the minimal value of Isin POQ, 
where P is an arbitrary integer point distinct from O, and Q = A(P). 

If the LLS cycle of M is (a),...,@2n), then the above problem is equivalent to 
finding the minimal numerator among the numerators of the rational numbers: 


[a13+++2n—1],  [a2s--+ 2 Gon], [a3s--s ton tail, ...  , [@ansay 2 +++ taza]. 


Example 11.16. For instance, if the LLS cycle of M is (a,b), where a < b, then the 
minimum for the numerators is a. 


Example 11.17. Let the minimal period of the LLS sequence consists of three ele- 
ments: (a,b,c,a,b,c). The numerator of the fraction [a;b : c : a: b] is minimal if and 
only if c > a,b. 
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The following example shows that it is not correct to skip the maximal entry of 
the sequence. 


Example 11.18. Consider the LLS cycle (1,4,5,4,1,4). The minimum of the nu- 
merators is attained at the fraction [1;4: 5:4: 1], and not at the fraction [4;1:4:1: 
4]. 


11.6.2 Frequencies of reduced matrices 


We start with a proper probabilistic space. Let P = (a1,a2,...,@2n—1,42n) be an 
arbitrary sequence of positive integer elements. Denote by #,(P) the number of 
matrices M satisfying the following: 

(i). The absolute value of every entry of M does not exceed N. 

(ii). The sequence P is the LLS cycle of M. 

(iii). The algorithm of the previous subsection with M as an input produces the 
reduced matrix [[p,r][g,s]] with (p,q) = (0, 1) for the case of P = (1,az), and 


q/p = |ai3a2 2 +++! dmn—1] 


for all the remaining cases. 


Let us formulate several pertinent open questions. 


Problem 5. (i) Which reduced matrix with a given trace (or with a given LLS 
sequence) is the most frequent as N tends to infinity? 

(ii) What is the relative probabilities of two sequences representing the same LLS 
cycle as N tends to infinity? 

(iii) Is it true that the reduced matrix with minimal ¢-complexity is the most frequent 
among the reduced matrices of a given LLS cycle? 


In Table 11.6.2 we show some values of #25900(P) for matrices with absolute 
value of the trace less than 11. 

Observe that SL(2,Z) matrices corresponding to the sequence (1,2) are more 
frequent than those corresponding to (1,1). This is due to the fact that matrices 
with LLS cycles (1,1) have integer congruent dual sails. One may enumerate the 
matrices with multiplicities equivalent to the number of integer congruent sails and 
obtain 

4#25000(1,1) > 2#25000(1,2) + 2#25000(2, 1). 


We conclude with the following question. Denote by GK(P) the frequency of a 


sequence P = (d1,d2,...,@2,_1) in the sense of Gauss—Kuzmin: 
1 (a +1)a 
GK(P) = In 
(P) In(2) ao 


where 
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Absolute value] Notation for classes | Period Matrix Value of 
of the trace |of equivalent matrices P P,T\[4;5]|  |#2s5000(P) 
3 L3 1,1 0, 1][—1,3]] | 663160 
4 Ly 1,2 0, 1][—1,4]] | 834328 
2,1 1,2)[1,3 304776 

5 Ls 1,3 0, 1][—1,5]] | 818200 
3,1 1,3][1,4 194528 

6 Leo 1,4 0, 1][—1,6]] | 777128 
4,1 1,4)[1,5 141784 

L62 2,2 1,2]|2,5 446432 

7 L1 1,5 0, 1][—1,7]] | 734904 
5,1 1,5][1,6 110848 

142 (1,1,1,1)) ][2,3][3,5 201744 

8 Lg 1,6 0, 1][—1,8]] | 695560 
6,1 1,6][1,7 90688 

Lg2 253 1,2][3,7 435472 

3,2 1,3][2,7 310872 

9 Lo 1,7 0, 1][—1,9]] | 660984 
7,1 1,7][1,8 76552 

10 L10,1 1,8 [[0, 1][—1, 10]]] 630592 
8,1 1,8][1,9 66064 

Li02 2,4 1,2]/4,9 408216 

4,2 1,4][2,9 239712 

L103 (1,1,1,2)]) ][2,3][5,8 260872 

(2,1,1,1)} [[2,5][3,8 114084 

(1,2,1,1)| [[3,4][5,7 149832 

(1,1,2,1)| [[3,5][4,7 114084 


Table 11.1 Values of #25099(P) for matrices with small absolute value of the trace. 


Ot) = [ay3a2 +++! A2n—2 ! A2n-1] and On = [ay3a2 2 +++! don—2 2 G2n—1 +1]. 
Problem 6. Let 


Pi = (@1,@2,..-42n—1,42n), Pi = (41,42,...2n—1), 
Px = (a2,43,..-€2n,41), Py = (a2,43,...@2n)- 


Ts the following true: 
#n(P GK(P’ 
ne #,(P2) GK(P4) 


For further references to Gauss—Kuzmin statistics, see Section 13. 


11.7 Exercises 


Exercise 11.1. Is it true that two SL(2,R) matrices are integer conjugate if and only 
if their geometric continued fractions are integer congruent? 


128 11 Elements of Gauss Reduction Theory 
Exercise 11.2. Consider a hyperbolic SL(2,Z) matrix. Let its eigen lines be y = 
2x. Then a; and Q are conjugate quadratic irrationalities. 


Exercise 11.3. Let A be a hyperbolic (2 x 2) matrix with irrational eigenvalues. 
Find all the matrices commuting with A. 


Exercise 11.4. Find all reduced matrices for the matrix 
103 69 
100 67) © 


Exercise 11.5. Prove Proposition 11.4(i). 


Check for 
updates | 


Chapter 12 
Lagrange’s Theorem 


The aim of this chapter is to study questions related to the periodicity of geomet- 
ric and regular continued fractions. The main object here is to prove Lagrange’s 
theorem stating that every quadratic irrationality has a periodic continued fraction, 
conversely that every periodic continued fraction is a quadratic irrationality. One of 
the ingredients to the proof of Lagrange theorem is the classical theorem on integer 
solutions of Pell’s equation 

nm — dn? =1. 


So, there is a strong relation between periodic fractions and quadratic irrationalities. 
It is natural to ask what happens in cases of cubic, quartic, etc., irrationalities? How 
can one generalize Lagrange’s theorem to the multidimensional case? We give a 
partial answer to such questions in Chapter 22. 

We start in Section 12.1 with the study of so-called Dirichlet groups, which are 
the subgroups of GL(2, Z) preserving certain pairs of lines. These groups are closely 
related to the periodicity of sails. The structure of a Dirichlet group is induced by 
the structure of the group of units in orders (we will discuss this later in more de- 
tail for the multidimensional case in Chapter 21); here we restrict ourselves to the 
simplest two-dimensional case. In Section 12.2 we take a break and show how to 
take nth roots of matrices using Gauss’s reduction theory. In Section 12.3 we study 
the solutions of Pell’s equation. And finally, in Section 12.4 we prove Lagrange’s 
theorem. 


12.1 The Dirichlet group 


Let A be a GL(2,Z) matrix with two distinct eigenvalues. Denote the set of all 
GL(2,Z) matrices commuting with A by £(A), ie., 


=(A) = {B € GL(2,Z) | AB = BA}. 


© Springer-Verlag GmbH Germany, part of Springer Nature 2022 129 
O.N. Karpenkov, Geometry of Continued Fractions, 

Algorithms and Computation in Mathematics 26, 

https://doi.org/10.1007/978-3-662-65277-0_12 


130 12 Lagrange’s Theorem 


The set (A) is a group under the operation of matrix multiplication. We call it 
the Dirichlet group in dimension 2. We study the multidimensional case later, in 
Chapter 21. 

In case of A with only real eigenvalues, we are interested in the subgroup of the 
Dirichlet group = (A) containing all matrices with positive eigenvalues. This group 
is called the positive Dirichlet group and denoted by =, (A). 


Definition 12.1. We say that two Dirichlet groups (or positive Dirichlet groups) =) 
and > are integer conjugate if there exists B € GL(2, Z) such that 


A—B!AB 


is an isomorphism of © and 2. 


Proposition 12.2. Two Dirichlet groups (or positive Dirichlet groups) =, and 
=. are integer congruent if there exist A € = with distinct eigenvalues and B € 
GL(2,Z) such that B~'AB € 2». 


Let us study the structure of Dirichlet groups for the matrices of GL(2,Z). There 
are three essentially different cases here (this is similar to the complex, rational 
and real irrational cases of Section 11). The first case is for elliptic matrices. In 
the second and the third cases we have hyperbolic matrices, i.e., matrices all of 
whose eigenvalues are real. In the second case we consider matrices with rational 
eigenvalues (parabolic case): the eigenvalues of such GL(2,Z) matrices are +1. In 
the third case we study matrices with irrational eigenvalues (hyperbolic case): the 


eigenvalues of such matrices are quadratic irrationalities (i.e., ee where a,b,d 
are integers and c > | is a square-free integer). 


Remark 12.3. Since conjugate matrices have isomorphic Dirichlet groups, it is 
enough to study only one representative of each conjugacy class of matrices in 
GL(2,Z). 

Recall also that the Dirichlet groups are defined only for matrices with distinct 
eigenvalues, so we will consider only matrices with distinct eigenvalues. 


Elliptic case. There are exactly two Dirichlet groups of such type up to conjugation. 
The representative matrices of these groups can be chosen from the group SL(2, Z). 
We have already seen them in Chapter 11: 


Gas 


The first matrix is the matrix of rotation by the angle 7/2. Only its powers commute 
with it, and hence the Dirichlet group is isomorphic to Z/4Z. The second matrix 
represents the 6-symmetry of a lattice. The related Dirichlet group is isomorphic to 
Z/6Z. 


Real spectra case I: rational eigenvalues (parabolic matrices). Here we have two 
conjugate classes of matrices represented by 
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10 ees 
amo): 4-071) 


The Dirichlet group for £(A;) is isomorphic to Z/2Z ® Z/2Z, and it is generated 
by +A; (for i = 1,2 respectively). 


Real spectra case II: irrational eigenvalues (hyperbolic matrices). This is the 
largest set of irreducible matrices whose conjugacy classes are described via Gauss’s 
reduction theory. We start with a GL(2,Z) matrix A with distinct irrational eigen- 
values (as we have already mentioned, the eigenvalues are conjugate quadratic ir- 
rationalities). First, note that A? is in SL(2,Z) and has distinct eigenvalues. Hence 
without loss of generality we restrict ourselves to SL(2,Z) matrices. Such matrices 
have periodic geometric continued fractions with periodic LLS sequences. Due to 
Gauss’s reduction theory and Proposition 8.9 there exists a one-to-one correspon- 
dence between integer conjugacy classes of Dirichlet groups and minimal periods 
of LLS sequences. 


Theorem 12.4. Consider an irreducible hyperbolic matrix A € SL(2,Z). Then its 
Dirichlet group © (A) is isomorphic to Z@ Z/2Z. 


Proof. The proof is straightforward. Any matrix commuting with A has the same 
eigenlines, so it either acts as a shift on the sail, or exchanges the sails. The group 
of shifts is homeomorphic to Z. Suppose it is generated by A. If the period is even, 
then the dual sails are not congruent, and the group is Z@Z/2Z generated by A 
and —Id. If the period of the LLS sequence of the sail is odd, then one can extract 
a square root of A in SL(2,Z). This square root will exchange dual sails, which are 
congruent in this case. The Dirichlet group is again Z @ Z/2Z, and it is generated 
by A!/? and —Id. 


Remark 12.5. Let us explain how to find generators of a two-dimensional Dirichlet 
group =(A) for a hyperbolic matrix with irrational eigenvalues. Take A”. This matrix 
has all positive eigenvalues, and therefore, it preserves every sail. Construct a sail of 
A’. The matrix A” defines a shift of the LLS sequence and hence one of its periods. 
Let this period be n times the minimal period. Then the second generator of 2 (A) is 
(A?) '/n (which is easily constructed using a Jordan basis for A in which the matrix 
is diagonal). 


Corollary 12.6. Consider an irreducible hyperbolic matrix A € SL(2,Z). Then its 
positive Dirichlet group =4(A) is homeomorphic to Z. 


Example 12.7. Consider the reduced matrix 
Ae 11 30\ fil 842-11 
“AIS 417 ~ \15 1142-15)" 
The period of the LLS sequence related to this continued fraction is (1,2,1,2,1,2). 


So there exists a cube root of A in SL(2,Z). This cube root is also reduced and has 
period of LLS sequence (1,2). Hence by Theorem 11.8 it is 
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10+421\ 12 
1 1421) ~ \13)° 


Denote this matrix by A. The group (A) is generated by A and —Id. 


Test for two Hyperbolic Dirichlet groups to be integer conjugate. Consider two 
GL(2,Z) matrices A and B with real irrational eigenvalues. We would like to check if 
they generate integer conjugate Dirichlet groups (i.e., that all elements of one group 
are integer conjugate to the elements of the other). This is similar to checking if the 
arrangements of eigenlines generated by matrices A and B are integer congruent to 
each other. 


Equivalently, we can write as follows. Consider two GL(2,Z) matrices A and B 
with real irrational eigenvalues. Do there exist integers n,m such that A” is integer 
conjugate to B”. 


In order to answer such questions one should compute periods of LLS sequences 
for A and B and find minimal periods for them (e.g., this could be done using the 
result of Theorem 8.27). These periods are the same up to a cyclic shift and possibly 
a change of the order of all the elements if and only if the corresponding cones are 
integer congruent to each other. 


12.2 Construction of the integer nth root of a GL(2, Z) matrix 


In this section we study how to check whether a certain matrix has roots that are 
also integer matrices. We start with SL(2,Z) matrices and further say a few words 
about GL(2,Z) case. 


Algorithm to construct an integer nth root of a GL(2,Z) matrix. 


Data. Suppose that we are given an SL(2,Z) matrix A = [[p,r][q,s]] with positive 
discriminant (this matrix has real spectrum with irrational eigenvalues) and a posi- 
tive integer n. 


Algorithm goals. To verify whether there exists and if so to find a matrix B € 
GL(2,Z) such that A = B”. 


Step 1. First we find one of the reduced matrices Az = [[p2,r2][qG2,52]] that is con- 
gruent to A and remember the congruence matrix C (i.e., Ay = CAC™!). 


Step 2. By Theorem 11.8 we have the formulas to write the LLS periods for A. 
There exists A3 = (A2)!/” that belongs to GL(2,Z) if and only if the constructed 
LLS periods have subperiods whose lengths are | /n that of Ao. 


Step 3. Suppose that the period repeats the sequence (a1,...,a,) n times. Then the 
matrix A3 is constructed by the formula A3 = [[p3,7r3][q3,53]], where p3 and q3 are 
relatively prime positive numbers defined by 
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93 
——— [a3+++ Qg-1 E 
P3 


The remaining two coefficients are 


_ Brn _ g2p3+(s2— p2)q3 
[33 ay s3 = 6 
q2 q2 


Since q2 > 0, all the coefficients are well defined. 


Output. We have verified that an nth root exists. If it exists, the resulting matrix B 
of this root is as follows: 
B=C'A3C. 


Proposition 12.8. The matrix A3 constructed in Step 3 is an nth root of the matrix 
Ap. 


Proof. There exists a matrix M € =(Az) mapping (1,0) to (a3,b3). This matrix 
corresponds to the sail integer self-congruence related to the shift for the period 
(a1,...,ax) if k is even. If k is odd, we have an integer congruence of dual sails. In 
both cases the matrix M is the nth root of A contained in = (A2). 

Since M(1,0) = (43,53), the first row of M coincides with the first row of A3. 
The coefficients in the second rows of A3 and M are uniquely defined by the fact 
that AxM@ = MA» and A3M = MA3. The formulas are obtained from these matrix 
equations (which are four equations in the elements). Hence A3 = M € 3(Az2) C 
GL(2,Z), and it is constructed by the formulas in the algorithm. 


Remark 12.9. Now let A € GL(2,Z) and det(A) = —1. It is clear that A does not 
have integer even roots. Suppose n is odd. Then to find the nth root one should 
check the (2n)th root for the matrix A? in SL(2,Z). The matrix (A7)!/2” (if it is in 
GL(2,Z)) is one of the nth roots of A. 


12.3 Pell’s equation 


Before studying Lagrange’s theorem, we focus on the classical Pell’s equation, 
which we will use in the proof of Lagrange theorem. Pell’s equation is a Diophan- 
tine equation of the form 

nm? — dn? = 1, 


where n and m are integer variables and d is not the square of an integer. Pell’s 
equation has trivial solutions (1,0). The problem of finding nontrivial solutions of 
Pell’s equation was posed by P. Fermat in 1657. The solution of Pell’s equation was 
mistakenly attributed to J. Pell. In fact, W. Brouncker found a method to solve this 
equation. The first to publish a strict proof was J.-L. Lagrange, in 1768 (see [140]). 
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Theorem 12.10. Let d be a nonsquare integer. Then Pell’s equation 
m —dn* =1 


has positive solutions. In addition, there exists a positive solution (m,,n,) such that 
the set of all positive solutions coincides with the set of pairs (mz,n,) with positive 
integer parameter k defined as follows: 


me\ —_ (m, dn, f 1 

Nk - ny Mm, 0}° 
Proof. Existence of a nonnegative solution. By Theorem 1.22 there are infinitely 
many integer solutions of the inequality 


1 
vd - 2 a 
| q|  V5q? 
Letting (p,q) be any of them, then 


2Vd+1 
lg-d — p"| = |avd — pl|gvd + p| = va —F | [pva+ (va-F)\¢ oF at 


Therefore, there exists an integer c such that the equation 
qd—p*=c 


has infinitely many integer solutions. Choose among these solutions (m,n) and 
(m2,n2) such that 


m, = m(modc) and ny =nz(modc). 


(This is possible, since the number of distinct remainder pairs is finite.) 
Now take 
mim — dnin2 mon, —dmyn2 


m= ———_——_ and ni 
Cc Cc 


Notice that 

ne?) Po = a = 

m = mj — dn, =c =0(modc) and A = myn, — myn; = 0(modc). 
Hence the point (7,7) is integer. Now let us consider 


m,—Vdn, m,+Vdn = mi, — dnt ec 


n2 nm? ~ A\ (A A 
fi — di? = (m—Vda)(m+Vda) = = 
( M ) my — Vdno m) +VJdn m — dn Cc 


= 1. 


(The second equality is established by direct calculations.) This concludes the proof 
of the existence of nonnegative solutions. 
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Structure of the set of solutions. Let (m,n) be a positive solution of Pell’s equation. 


Then 
m 1 m dn 
4 =Amn (4) , Where Any = (" ) . 


Notice that Aj, is an integer matrix with characteristic polynomial t? —2mt +1 in 
variable t. Hence, Am» is in SL(n,Z), and all its eigenvalues are distinct positive 
real numbers. 

Consider an angle @ /7 with vertex at the origin and edges y = +x/ Vd for x > 0. 
Let us show that Aj, preserves Va" 

The union of lines containing 0 7 is defined by the equation 


x —dy’ =0. 


Let us find the image of this union with respect to the action of A». The equation 
for the image is 
(mx + dny)* — d(nx+my)* =0, 


which is equivalent (after simplification) to the initial equation 
x’ —dy’ =0. 


Hence A», preserves the union of two lines. There are two possible situations here: 
either these lines are invariant lines of Aj», Or Amn interchanges the lines. Since 
points with positive coordinates are taken to points with positive coordinates, the 
lines are eigenlines. Finally, since the eigenvalues are positive real numbers, the 
rays defining the angle @ ,7 are preserved. 

So the matrix Aj, is in the positive Dirichlet group ©, preserving the angle a 7. 
It is easy to see that the point (1,0) is on the sail of a Vg (it is the only point with 
x <1 in the angle). 

We have shown that every positive solution of Pell’s equation is defined by a shift 
of a sail by a matrix of the positive Dirichlet group 21. From the other point of view, 
every matrix of the Dirichlet group preserves the value x” — dy? (which is 1 in our 
case). Recall that #4 is isomorphic to Z. Letting A generate this group (we choose 
A such that it takes (1,0) to the half-plane y > 0), then all the solutions with positive 
x-coordinate are of the form A*(1,0). We choose only a positive parameter k, in 
order to have positive y-coordinates of the solutions. Finally, denoting by (m,n) 
the point A(1,0), then the matrix 

m, dn, 
(nm 


is in ¥, and defines the same shift of the sail (as A), hence equaling A. This con- 
cludes the proof of the second statement of the theorem. 
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12.4 Periodic continued fractions and quadratic irrationalities 


A continued fraction [ag;a1 : ---] is called periodic if there exist positive integers ko 
and h such that for every k > ko, 


Ak+h = Ag. 


We denote it by [ao3a1 +++! Ak! (Aky41 i °** 1 Aky +h): 
Theorem 12.11. (Lagrange.) Every periodic regular continued fraction is a quad- 


ratic irrationality (i.e., a+ OVE for some integers a, b, c, and d, where b#0,c>1, 


d > 0, and c is square-free). The converse is also true: every quadratic irrationality 
has a periodic regular continued fraction. 


Lemma 12.12. For every quadratic irrationality & there exists an SL(2,Z) matrix 
such that one of its eigenvectors is (1,€). 


Proof. Let & be a root of the equation crx + c)x + co = 0 with integer coefficients 
Co, C1, C2. Consider an arbitrary matrix 


aij a 
A- 11 412 ; 
421 422 


(: ayn —ayy+/(arn seni) 


2a}2 


Its eigenvectors are 


Notice that € and its conjugate root are of the form 


—cy+, pee —4coc2 


2c2 


So the matrix A has a root (1,6) if the following system is satisfied: 


nco = —421, 
nc, = a\1 — 422, 
ncz = a)2, 


for some n £0. 
Let us find a matrix of SL(2,Z) satisfying this system for some integer n. Since 
n is an integer, the coefficients a; and a2, are integers as well. Since detA = 1, we 
have 
detA = a11d22 — a12a21 = ay1(na\1 — C1) +ncocr = 1. 


Therefore, 
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ney Meas —4Acoc2) —4 
5 ; 
The coefficient aj; is an integer if and only if there exists an integer m satisfying 


ay1= 


m? =n? (ci —4coc2) — 4. 
Set D = ci} —4coc2, m! = m/2, and n! = n/2 and rewrite the equation 
m? — Dn? =1. 


We end up with Pell’s equation. Since € is irrational, the discriminant D = G —Acoc2 
is not the square of some integer (since € is real, D > 0). Hence, by Theorem 12.10, 
it has an integer solution (m,n) with nj # 0. Hence the matrix 


/ / / 
My —NgC1 2Nyc2 
—2npco my +npc1 


has integer elements and unit determinant. Therefore, it is in SL(2,Z). 


Proof of Lagrange’s theorem. First, let us show that every periodic continued frac- 
tion represents a quadratic irrationality. Since the continued fraction is infinite, the 
corresponding number is irrational. 

Suppose that the periodic continued fraction for € does not have a preperiod, i.e., 


& = [(ao;ay 3 +++: dy)). 


Then Z 
+ Pn-1 
E =agsay +++ ta, 2) = PnS ¥ Pn-1 
Gus Tr n-1 
(The last equality holds by Theorem 1.13.) Notice that the denominator g,_1& + 
Gn—2 iS nonzero, since & is irrational. Therefore & satisfies 


Qnie + (dn—1 — pn-1)§ — Pn-2 = 9. 


Hence € is a quadratic irrationality. 
Suppose now that 


& = [ao3a) 2 +++ Gn: (Qn41 : dn422°°* 2 Antm))|- 


Set 


g = [(An+13 An42 nee ee an+m)]|- 


Then by Theorem 1.13 we have 


2 + Pn-1 
E = [aosay s+ 10:8] = PST Pet 
Gn§ + 4n—1 
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We have already shown that E is a quadratic irrationality. Hence & is a quadratic 
irrationality as well. 


Secondly, we prove that every quadratic irrationality has a periodic continued 
fraction. Let € > 1 be a quadratic irrationality. By Lemma 12.12 there exists an 
SL(2,Z) matrix A with an eigenvector (1,6). By Theorem 11.9 the geometric con- 
tinued fraction of A has periodic LLS sequence. The LLS sequence for the angle a 
generated by the two vectors (1,0) and (1,6) is one-side infinite, and by Proposi- 
tion 4.13, from some element it coincides (up to a shift) with the LLS sequence for 
the matrix A. Hence, the LLS sequence for a@ is periodic from some point on. Hence 
from Corollary 3.7 and Theorem 3.8, the continued fraction for & is periodic. 

Suppose now that € < 1. The number & = €—|€]-+1 is quadratic and greater 
than 1, and hence it is periodic by the above. The continued fractions for € and E 
are distinct only in the first element. Hence the continued fraction for & is periodic 
as well. 


12.5 Exercises 


Exercise 12.1. Prove Proposition 12.2. 


Exercise 12.2. Prove that the two Dirichlet groups of hyperbolic matrices with ra- 
tional eigenvalues 


are not integer congruent. 


Exercise 12.3. Let A be a (2 x 2) matrix with distinct eigenvalues. Then every 
matrix commuting with A has the same eigenlines. 


Exercise 12.4. Do the following matrices have a cube root in GL(2, Z): 
é i") : be se) 5 
32 41)” 144 233} ° 
Exercise 12.5. Solve explicitly Pell’s equation 
m —dn* =1 


ford =2,3,5. 
Exercise 12.6. Find an SL(2,Z) matrix with eigenvector (1, 11). 


Check for 
updates | 


Chapter 13 
Gauss—Kuzmin Statistics 


It turns out that for almost all real numbers the frequency of a positive integer k in a 
continued fraction is equal to 


vs In (1 + TES) ; 

In2 k(k +2) 
i.e., for a general real x we have 42% of 1, 17% of 2, 9% of 3, etc. This distribution 
is traditionally called the Gauss—Kuzmin distribution. The statistics of the elements 
in continued fractions first appeared in the letters of K.F. Gauss to P.S. Laplace at 
the beginning of the nineteenth century (see [70], and also in [14]). The first proof 
with additional estimates was developed by R.O. Kuzmin in [132] in 1928 (see 
also [133]), and a little later, another proof with new estimates was given by P. Lévy 
in [142]. Further investigations in this directions were made by E. Wirsing in [224]. 


In this chapter we describe two strategies to study distributions of elements in 
continued fractions. A classical approach to the Gauss—Kuzmin distribution is 
based on the ergodicity of the Gauss map. The second approach is related to the ge- 
ometry of continued fractions and its projective invariance. It is interesting to note 
that the frequencies of elements has an unexpected interpretation in terms of cross- 
ratios (see Remark 13.31). Unfortunately, the classical approach does not have a 
generalization to the case of multidimensional sails, since it is not clear what map is 
the multidimensional analogue of the Gauss map. We avoid this problem by using a 
geometric approach to define and investigate multidimensional statistical questions 
for multidimensional sails. We describe the multidimensional case in Chapter 23. 


In the first five sections of this chapter we discuss the classical ergodic approach 
to the Gauss—Kuzmin distribution. In Section 13.1 we give some basic notions 
and definitions of ergodic theory. Further, in Sections 13.2 and 13.3, we present a 
measure related to continued fractions and the Gauss map. We prove the pointwise 
Gauss—Kuzmin theorem and formulate the original Gauss—Kuzmin theorem in 
Sections 13.4 and 13.5 respectively. 
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In the last five sections we study the statistic of edges of geometric one-dimen- 
sional continued fractions. After a brief discussion of cross-ratios (Section 13.6) 
we define a structure of a smooth manifold on the set of geometric continued frac- 
tions CF; in Section 13.7. Further, in Section 13.8, we define the Mobius measure 
on CF, which is invariant under the group PGL(R, 2) acting on CF; we write the 
Mobius form explicitly in Section 13.9. Finally, in Section 13.10, we define related 
frequencies of edges of continued fractions and show that they coincide with the 
Gauss—Kuzmin statistics of the elements of continued fractions. 


13.1 Some information from ergodic theory 


Let X be a set, Y a o-algebra on X, and wt a measure on the elements of XY. The 
collection (X,2,1) is called a measure space. If u(X) = 1, the measure space is 
called a probability measure space. 

Given a transformation T of a set X to itself, for every [-integrable function f 
on X one can define the time average for f at the point x to be 


n 


=A 
f(x) = lim i Y f(T*s). 
k=0 


n°) = 


The space average I, is 


p= wm | fae 


The space average always exists. The time average does not exist for all x. Neverthe- 
less, in the case in which we are interested, it exists for almost all x. We formulate 
the related theorem after one important definition. 


Definition 13.1. Let (X,2,,) be a measure space. A transformation T : X — X is 
measure-preserving if it is measurable and 


for every set A of X. 
For measure-preserving transformations we have the following theorem. 


Theorem 13.2. (Birkhoff’s Pointwise Ergodic Theorem.) Consider an arbitrary 
measure space (X,2,[L) and a measure preserving transformation T on X. Let f be 
a yl-integrable function on X. Then the time average converges almost everywhere 
to an invariant function f. 


Definition 13.3. Consider a probability measure space (X ,2,u). A measure-preser- 
ving transformation T on X is ergodic if for every X' € E satisfying T~!(X’) = X’, 
either U(X’) = 0 or n(X’) = 1. 
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Theorem 13.4. (Birkhoff—Khinchin’s Ergodic Theorem.) Consider a probabil- 
ity measure space (X,X,[) and a measure preserving transformation T. Suppose 
that T is ergodic. Then the values of the time average function are equal to the space 
average (i.e., f(x) =I) almost everywhere. 


13.2 The measure space related to continued fractions 


In this section we define a measure space that is closely related to distributions of 
the elements of continued fractions. For this measure we formulate a statement on 
the density of points for measurable subsets, which we use in the essential way in 
the proofs below. 


13.2.1 Definition of the measure space related to continued 
fractions 


Consider the measure space of the segment J = {x | 0 < x < 1} with the Borel o- 
algebra X and the measure fl defined on a measurable set S as 


2 1 dx 
~~ In2/ 14x" 
S 


f1(S) 


The coefficient | /In2 is taken such that the measure of the segment J equals 1. 


13.2.2 Theorems on density points of measurable subsets 


We start with a classical theorem on Lebesgue measure space. Denote by B(x, €) the 
standard ball of radius € centered at x. 


Theorem 13.5. (Lebesgue density.) Let A be the n-dimensional Lebesgue measure 
on R”. If A C R" is a Borel measurable set, then almost every point x € A is a 
Lebesgue density point: 

A(ANB(x,€)) 


lim ————__~ = 


e0 A(B(x,€)) 


Here “almost every point” means “except for a subset of zero measure”’. 


The measure fi is equivalent to the one-dimensional Lebesgue measure A on the 
segment [0, 1] (for more information on measure theory, see [148]). Hence we have 
a similar statement in the case of the measure space (X,2, /1). 
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01 ete. 1/3 1/2 


Fig. 13.1 The Gauss map. 


Corollary 13.6. ({i-density.) Let X = [0,1] and let fi be as above. If A CX isa 
fi-measurable set with positive measure [1(A), then almost every point in A satisfies 
f(ANB(x,e)) 


e) p(Ble)) 


13.3 On the Gauss map 


Let us introduce a transformation whose ergodic properties will form the basis for 
the proof of the Gauss—Kuzmin theorem. 


13.3.1 The Gauss map and corresponding invariant measure 


We consider the measure space (X ,2, fi) defined in the previous section. Define the 
Gauss map T of a segment [0, 1] to itself as follows: 


T(x) = {1/x}, 
where {r} denotes the fractional part r— |r| (see Fig. 13.7). 


Proposition 13.7. The Gauss map T is measure-preserving for the measure space 
(X,2, (1). 


We start with the following lemma. 
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Lemma 13.8. Let x = [0;a) : a2: ---]. Then 


7 1 

T(x) = {[0;k: a) 1a. :--JJkE Z4} = (saqlke 2+}. 
Proof. The first equality follows directly from the fact that 
T ([0;b1 : bz: -+-]) = [03b2 : b3...] 


and the fact that every real number has a unique regular continued fraction expansion 
with the last element not equal to 1. 
The second equality is straightforward. 


Proof of Proposition 13.7. Consider a measurable set S. From Lemma 13.8 it follows 
that 


pe me ie l+x ree / ee 


T~\(S) Ts) H 


Notice that on each open (i.e., without the boundary points) segment ]1/k,1/(k+1)[ 
the operator T is in one-to-one correspondence with the open segment ]0, 1|. Let us 
denote the inverse function to T on the segment |1/k,1/(k+1)[ by Ty) Therefore, 


r(r-1(s)n eval) =1(T;(S)) =5, 


and we can apply the rule of differentiation of a composite function. Recall that 
from Lemma 13.8 we know, that 


Then we have 


| dx dx a aT (*) [tee 


T-!(S) hy] 


“| mpthns (x+k)(x+k+1) 

Ss 
(the negative sign is taken, since the map TK mt X45 + changes the orientation). 
So we have 
1S dx 


BOO) ina | (x+k)(x+k4+1) 


Since the integrated functions are nonnegative, we can change the order of the sum- 
mation and the integration operations. We get 
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1 = 1 
)ar= ))a 
» Garett) (x-+k) aren) 7 ina | ( Dt ak xk) ) 


A(T” '(S)) = 


a1 ( 
In2 
Ss 
cs 
In2 
Ky 
So for every measurable set S we have 


A(T '(S)) = As). 


Therefore, the Gauss map T preserves the measure fl. 


Remark 13.9. (On the Euler—Mascheroni constant.) By definition, the Euler— 
Mascheroni constant (traditionally denoted by /) is the following infinite sum 


; “ 1 
y= fim (9 — Inn): 


k=1 


It was first studied by L. Euler in 1734. It is not known whether y is irrational. It 
turns out that the Euler—Mascheroni constant can be expressed as an integral of the 
Gauss map with respect to Lebesgue measure: 


y=1 — [ T(s)ae. 


0 


13.3.2 An example of an invariant set for the Gauss map 


Let us consider one example of a measurable set that is invariant under the Gauss 
map. 

Denote by ¥ the set of all irrational numbers in the segment [0, 1] whose contin- 
ued fractions contain only finitely many 1’s. It is clear that 


TP) =, 


since the operation T~! shifts elements of continued fractions by one and inserts the 
first element. 


Proposition 13.10. The set ¥ is measurable (i.e, ¥ € Z). 


Proof. Denote by Y, the set of all irrational numbers that contain the element 1 


exactly at place n. Notice that 
N= [1/2, 1], 


and therefore, it is measurable. Hence for every n the set 


ha =T"(%) 


13.3, On the Gauss map 145 


is measurable. 
Denote by Y% the set of all irrational numbers that do not contain an element ’1’. 
Since 


m=X\ Uh, 


n=1 


the set Y% is also measurable. Then 
T"(%) 


is measurable for any positive integer n. Hence 


co 


w= UT") 


n=1 


is measurable. 


We will prove later that the Gauss map is ergodic, and therefore, Y is either of 
zero measure or full measure in X. 


13.3.3 Ergodicity of the Gauss map 


In this subsection we prove the ergodicity of the Gauss map. 
Proposition 13.11. The Gauss map is ergodic. 


Before proving Proposition 13.11 we introduce some supplementary notation and 
prove two lemmas. 


For a sequence of positive integers (a1,...,a,) denote by T(ay,...,an) the segment 
with endpoints [0;a, : +--+ : dy—1 : dy] and [0;a) : +++: dy—1 2 dy + 1]. It is clear that 
the map 

hee Lay ..-an) fa (0, 1 


an)» and the inverse to T” is 


Ln as x [Osay 2 +++ tay: 1/2]. 
In terms of k-convergents py /qx = [O;ay : --- : ag], the expression for T/,' (x) is 


as follows (see Proposition 1.13): 


-1 = Pn[x+ Pn-1 — Pat Pn-1* 
(arte) ) = oT ye 
Gn/X + Qn-1 Gn T Qn-1X 


Lemma 13.12. The measure of a segment Iq, 


ity: 
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1 


f(T, < ; 
Paste) < 9G bg) Puan) 
Proof. We have 
[0;ay :++:an21] 
n (I = f 1 dx 
ECGs) 9 l+x In2 +x 
Kay yan) (Osa) :--:an] 
I ee Pno bn Pn 1 1 1 
wf BEY | a ) 
In2 1 aE Ge In2 (Gn+9n—1)(Pn+4n) 
1 


< . 
In2(Gn+4n—1)(Pnt4n) 


The last inequality follows from the concavity of the natural logarithm function. 


Lemma 13.13. For any invariant set S of positive measure and any interval 


Proof. Since the map T is surjective, we also have 


fg GY (mney a 


We have 
TPn-1% 
1 if ut dee 2) 
In2 a +x ~ Tn? 14 ore Pn+Pn-1x 
SU(ay,....an) nT n—1X 
_ = | dx 
In2 ‘ (Gn + Qn—-1X) (dn + qn-1X+ Pn + Pn—1X) 
1 dx 
2 
In2-Gn(n +4n-1+Pn + Pn—1) . 1+x 
1 
= f(S 
Gn(4n + 9n—1 + Pn rea ad ) 
1 In2 


= CREE OES lace, 2 AS) A ay,...an))+ 


The last inequality follows from Lemma 13.12. 


Proof of Proposition 13.11. Let S be a measurable subset of the unit interval such 
that T~'(S) =S. Suppose also fi(S) > 0 
For any irrational number y = [0;a) : a2 : ---], from Lemma 13.13 we have 
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AC(X\S)NBY Aas.) 4 In2 ACS) AU ay.....an)) m2, 
A(B(y, A ay,...,an)))) ~ 2 A(BYy,Aa,,...,an)))) 4 


Hence y is not a fi-density point of X \ S. Therefore, by Corollary 13.6 almost every 
point of [0, 1] \ Q is not in X \ S, and therefore, it is in S. Hence 


A(S) > A([0,1]\Q) =1 


Hence fi(S) = 1, concluding the proof of ergodicity of T. 


13.4 Pointwise Gauss—Kuzmin theorem 


Consider x in the segment [0, 1]. Let the regular continued fraction for x be [0;a1 : 
+++! py] (odd or infinite). For a positive integer k, set 


R #(k,n 
ja 
n 
where #(k,7) is the number of integer elements a; equal to k for i= 1,...,n. Define 


P,(x) = lim P, x(x). 


Nn—yoo 


Theorem 13.14. For every positive integer k and almost every x (i.e., in the com- 
plement of a set of zero measure) the following holds: 


Pix) = o a (1 e iD) 


We consider this theorem a pointwise Gauss—Kuzmin theorem. To prove this 
pointwise Gauss—Kuzmin theorem we use Birkhoff’s ergodic theorems. 


Proof. Consider a subset SCI. Let 7s be the characteristic function of S, i.e., 


ya fh ifxes, 
Us) = 0, otherwise. 
Then 
. yu 
aoe s 
Pax (x) = i Laut x) 


Hence, by Birkhoff’s pointwise ergodic theorem, the limit P,(x) exists almost ev- 
erywhere. Since the transformation T is ergodic, we apply the Birkhoff—Khinchin 
ergodic theorem and get 
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1/k 
Aw =| fhe of ease 1 
HP POP tnd Tt+x ind \ k(k+2))° 
0 1/(k+1) 


13.5 Original Gauss—Kuzmin theorem 


Let & be some irrational number between zero and one, and let [0;a1 : az : a3: --+] 
be its regular continued fraction. 

Let m,(x) denote the Lebesgue measure of the set of real numbers @ contained 
in the segment [0, 1] such that T”(a@) <x (here T is the Gauss map). In his letters to 
P.S. Laplace C.F. Gauss formulated without proofs the following theorem. 


Theorem 13.15. Gauss—Kuzmin. For 0 < x < 1 the following holds: 


In(1 
lim m,(x) = IEE) 2), 
nyo In2 


This theorem is technically complicated. For the proof we refer to the original 
manuscripts of R.O. Kuzmin [132] and [133] (see also A. Ya. Khinchin [116]). 

Denote by P,,(k), for an arbitrary integer k > 0, the measure of the set of all real 
numbers a of the segment [0,1] such that each of them has the number k at the 
nth position. The limit Jim P,(k) is called the frequency of k for regular continued 


fractions and is denoted by P(k). 
Corollary 13.16. For every positive integer k, the following holds: 


n= a a (1 | ae) 


Proof. Notice that P,(k) = mn(z) — mn (qq) Now the statement of the corollary 
follows from the Gauss—Kuzmin theorem. 


13.6 Cross-ratio in projective geometry 


In this section we switch to the multidimensional case for a while in order to give 
some definitions that are similar to the one-dimensional case (we will use these later, 
in Chapter 23). 
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13.6.1 Projective linear group 


The projective linear group (or the group of projective transformations) is the quo- 
tient group 
PGL(R,n) = GL(R,n)/Z(R,n), 


where Z(R,7) is the one-dimensional subgroup of all nonzero scalar transformations 
of R”. The group PGL(R,n) acts on the equivalence classes of vectors in R” with 
respect to Z(IR,n). We have 


R"/Z(R,n) =RP™|. 


Consider the affine part R’~! C RP"~!. The stabilizer for the affine part is exactly 
the group Aff(R,n—1). 


13.6.2 Cross-ratio, infinitesimal cross-ratio 


Consider an arbitrary line in R”~! with a Euclidean coordinate on it. 


Definition 13.17. Consider a 4-tuple of points on a line with coordinates z1, z2, z3, 
and z4. The value 

(z1 — z3)(Z2 — za) 

(29-33) (Zi 24) 


is called the cross-ratio of the 4-tuple. 


It is clear that the cross-ratio does not depend on the choice of the Euclidean 
coordinate on the line, and therefore, it a function on the space of ordered 4-tuples 
of distinct points in a line. 

As we have already noted above, the space R”~! can be considered an affine 
chart R”~! C RP”~!. Hence the action of PGL(IR,n) is well defined on the closure 
of R”~! (which is actually RP”~!). The projective transformations take planes to 
planes and, in particular, lines to lines. So it is natural to ask what happens to the 
cross-ratios of four points on a line. 


Proposition 13.18. The cross-ratio of four points is an invariant of projective trans- 
formations of R". 


We are interested in the infinitesimal cross-ratio, which is the following 2-form: 


dxdy 
(x—y) 


oe 
Notice that in the denominator we have 


(x—y)* = lim (x—(y+edy))-((x+e€dx)—y). 
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Corollary 13.19. The infinitesimal cross-ratio is an invariant of projective trans- 
formations of R"~!. 


Proof. The density of the infinitesimal cross-ratio coincides with the asymptotic 
coefficient at €* of the cross-ratios of 4-tuples of points: 


x, y, x+Eé€dx, ytedy, 


as € tends to 0. Therefore, the infinitesimal cross-ratio is a projective invariant. 


13.7 Smooth manifold of geometric continued fractions 


Denote the set of all geometric continued fractions by CF;. Consider an arbitrary el- 
ement of CF,. It is a continued fraction defined by an (unordered) pair of nonparallel 
lines (1, €2) passing through integer points. 

Denote the sets of all ordered collections of two independent and dependent 
straight lines by FCF, and A, respectively. We say that FCF, is a space of geo- 
metric framed continued fractions. We have 


FCF, = (RP! xRP')\A;=T*\A; 9 and = CF, = FCF, /(Z/2Z), 


where Z/2Z is the group transposing the lines in geometric continued fractions. 
Note that FCF, is a 2-fold covering of CF. We call the map of “forgetting” the 
order in the ordered collections the natural projection of the manifold FCF; to the 
manifold CF, and denote it by p (i.e., p: FCF; — CF)). 

Notice that FCF, is homeomorphic to the annulus and CF) is homeomorphic to 
the Mobius band. 


13.8 Mobius measure on the manifolds of continued fractions 


The group PGL(2,R) of transformations of RP! takes the set of all straight lines 
passing through the origin in the plane into itself. Hence, PGL(2,IR) naturally acts 
on CF, and FCF\. It is clear that the action of PGL(2,R) is transitive, i-e., it takes 
any (framed) continued fraction to any other. Notice that a stabilizer of any geomet- 
ric continued fraction is one-dimensional. 


Definition 13.20. A form on the manifold CF; (respectively FCF) is said to be a 
Mobius form if it is invariant under the action of PGL(2,R). 


Remark 13.21. The name for the invariant forms comes from theory of energies 
of knots and graphs in low dimensional topology, where these forms are used as 
densities for Mobius energies that are invariant under the group of Mobius transfor- 
mations in R? (we refer the interested reader to the book by J. O’ Hara [163]). 
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Proposition 13.22. All Mébius forms of the manifolds CF, and FCF, are propor- 
tional. 


Proof. Transitivity of the action of PGL(2,R) implies that all Mébius forms of the 
manifolds CF; and FCF; are proportional. 


Let @ be some volume form of the manifold M. Denote by [Lm a measure of the 
manifold M that for every open measurable set S contained in the same piecewise 
connected component of M is defined by the equality 


Definition 13.23. A measure u of the manifold CF, (FCF}) is said to be a Mébius 
measure if there exists a Mobius form @ of CF, (FCF{) such that LU = Uo. 


From Proposition 13.22 we have the following. 


Corollary 13.24. Any two Mobius measures are proportional. 


Remark 13.25. The projection p takes the Mobius measures of the manifold FCF, to 
the Mobius measures of the manifold CF;. This establishes an isomorphism between 
the spaces of Mobius measures for CF, and FCF. Since the manifold of framed 
continued fractions possesses simpler chart systems, all formulas of the work are 
given for the case of the framed continued fraction manifold. To calculate a mea- 
sure of some set F of the unframed continued fraction manifold, one should: take 
p_'!(F); calculate the Mébius measure of the obtained set of the manifold of framed 
continued fractions, and divide the result by 2. 


13.9 Explicit formulas for the Mobius form 


Let us write down Mobius forms of the framed one-dimensional continued fraction 
manifold FCF, explicitly in special charts. 

Consider a vector space R? equipped with standard metrics on it. Letting / be 
an arbitrary straight line in R* that does not pass through the origin, choose some 
Euclidean coordinates O;X; on it. Denote by FCF, a chart of the manifold FCF; 
that consists of all ordered pairs of straight lines both intersecting /. Let us associate 
to any point of FCF, ; (i.e. to a collection of two straight lines) coordinates (41,91); 
where x; and y; are the coordinates on / for the intersections of / with the first and 
the second straight lines of the collection respectively. Denote by |¥|; the Euclidean 
length of a vector V in the coordinates O/X/Y; of the chart FCF, ,. Note that the chart 
FCF, is the space R x R minus its diagonal. 

Consider the following form in the chart FCF, ;: 


dx; \dy, 


FEY 
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Proposition 13.26. The measure Uw, coincides with the restriction of some Mobius 
measures to FCF, j. 


Proof. Notice that the form @;(x;,y;) coincides with the infinitesimal cross-ratio on 
the line /. Hence it is invariant under projective transformations of / (on an every- 
where dense subset) in the chart FCF) ;. Therefore, the measure [, coincides with 
the restriction of some Mobius measures to FCF). 


Corollary 13.27. A restriction of an arbitrary Mobius measure to the chart FCF, 
is proportional to Le,. 


— 


Proof. The statement follows from the proportionality of any two Mobius measures. 


Consider now the manifold FCF; as a set of ordered pairs of distinct points on 
a circle R/7Z (this circle is a one-dimensional projective space obtained from the 
unit circle by identifying antipodal points). The doubled angular coordinate @ of the 
circle R/2Z induced by the coordinate x of the straight line R naturally defines the 
coordinates (@, 2) of the manifold FCF,. 


Proposition 13.28. The form @;(x;,,y;) is extendible to some form @, of FCF. In 
coordinates (1,2), the form @, can be written as follows: 


The proof of Proposition 13.28 is left as an exercise for the reader. 


13.10 Relative frequencies of edges of one-dimensional continued 
fractions 


Without loss of generality, in this section we consider only the Mobius form @, of 
Proposition 13.28. Denote the natural projection of the form U@, to the manifold of 
one-dimensional continued fractions CF; by [. 

Consider an arbitrary segment F with vertices at integer points. Denote by 
CF, (F) the set of continued fractions that contain the segment F as an edge. 


Definition 13.29. The quantity 11;(CF,(F)) is called the relative frequency of the 
edge F. 


Note that the relative frequencies of edges of the same integer-linear type are 
equivalent. Every edge of a one-dimensional continued fraction is at unit integer 
distance from the origin. Thus, the integer-linear type of a segment is defined by its 
integer length (the number of inner integer points plus unity). Denote the relative 
frequency of the edge of integer length k by p1)("k"). 
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Fig. 13.2 Rays defining a continued fraction should lie in the domain shaded in gray. 


Proposition 13.30. For every positive integer k, the following holds: 


pi('") =n (14 ee). 


Proof. Consider a particular representative of an integer-linear type of a length-k 
segment: the segment with vertices (0, 1) and (k, 1). The one-dimensional continued 
fraction contains the segment as an edge if and only if one of the straight lines 
defining the fraction intersects the interval with vertices (—1,1) and (0,1) while 
the other straight line intersects the interval with vertices (k,1) and (k+1,1) (see 
Fig. 13.2). 

For the straight line / defined by the equation y = 1, we calculate the Mébius 
measure of the Cartesian product of the described pair of intervals. By the last sec- 
tion it follows that this quantity coincides with the relative frequency p1)("k"). So, 


0 k+1 


k+1 
1 1 
uy (UK) Leto = PLGA an 
24 yy ytl 


(k+1)(k+1) 1 
in| dl 
Rk +2) k(k-+2) 


This proves the proposition. 


Remark 13.31. Note that the argument of the logarithm aj At is the cross-ratio 


of points (—1,1), (0,1), (k, 1), and (k+1, 1). 
Corollary 13.32. The relative frequency Lt ("k"), up to the factor 


coincides with the Gauss—Kuzmin frequency P(k). 
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13.11 Exercises 


Exercise 13.1. (a) Prove that the measure 


is a probability measure on the segment (0, 1], i-e., u([0, 1]) = 1. 
(b) Find u([a,b]) for0 <a <b <1, where p is as above. 


Exercise 13.2. Ergodicity of the doubling map. Consider the space (S!,Z,/), 
where X is the unit circle, Y is the Borel o-algebra, and A is the Lebesgue measure. 
Consider the doubling map T : S' —> S! such that 


Prove that T is measure-preserving and ergodic. 


Exercise 13.3. Define the frequencies of subsequences in continued fractions. What 
is the frequency of the sequence (1,2,3)? 


Exercise 13.4. Prove the fi-density theorem from the Lebesgue density theorem. 


Exercise 13.5. Recall that Y% is the subset irrational numbers in [0, 1] whose contin- 
ued fractions do not contain | as an element. Prove by elementary means (without 
using ergodic theorems) that 

fi(%) = 0. 
Exercise 13.6. Prove the projective invariance of the cross-ratio. 


Exercise 13.7. Prove that any two triples of points on a line are projectively equiva- 
lent. Find a criterion for two 4-tuples of points on a line to be projectively equivalent. 


Exercise 13.8. Prove that 

(a) RP! is homeomorphic to a circle; 

(b) FCF; is homeomorphic to an annulus; 

(c) CF, is homeomorphic to the Mobius band. 


Exercise 13.9. Prove Proposition 13.28. 


| ® 
Check for 
updates 


Chapter 14 
Geometric Aspects of Approximation 


The approximation properties of continued fractions have attracted researchers for 
centuries. There are many different directions of investigation in this important sub- 
ject (the study of best approximations, badly approximable numbers, etc.). In this 
chapter we consider two geometric questions of approximations by continued frac- 
tions. In Section 14.1 we prove two classical results on best approximations of real 
numbers by rational numbers. Further, in Section 14.2, we describe a rather new 
branch of generalized Diophantine approximations concerning arrangements of two 
lines in the plane passing through the origin. In this chapter we use some mate- 
rial related to basic properties of continued fractions of Chapter 1, to geometry of 
numbers of Chapter 3, and to Markov—Davenport forms of Chapter 11. 


14.1 Two types of best approximations of rational numbers 


In this section we study the problem of best approximations of real numbers by ra- 
tional numbers. We prove here a classical result that all best approximations p/q 
are represented by the points (p,q) of the corresponding sails. Conversely, the point 
(p,q) of the sails is a best approximation if the denominator q satisfies the so-called 
half-rule. Further, we discuss strong best approximations, which are exactly conver- 
gents of the corresponding continued fractions. 


14.1.1 Best Diophantine approximations 


We say that a rational number p/q (where q > 0) is a best approximation of a real 
number a if for any other fraction p’/q' with 0 < q' <q we get 


/ 


a2) >|a—2] 
q q 
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Remark 14.1. By the definition of best approximations, every integer is a best ap- 
proximation of every real number @. To be more consistent we should say that the 
best approximations with integer denominator are | a| and |a| +1. 

Remark 14.2. From the definition of best approximations it is not immediate that 
there are no two best approximations with the same denominator (greater than 1). 
This is a direct corollary of Theorem 14.3 below. 


Theorem 14.3. Let a be a positive real number with regular continued frac- 
tion [aj;a1 : ---]. Denote by px/qx its k-convergent and by rx the remainder part 
(ag; p41 2 +++]. A rational number p/q (with q > 1) is a best approximation of a if 
and only if there exist k > 0 and an integer x such that 


P/q = |aora1 : +++: aK: x], 


where x satisfies either 


ak+1 
| = | <X <4] 
or alternatively x = ax4,/2 in the specific case in which ag, is divisible by 2 and 
Q satisfies the inequality 


Gk+1 
Tk-1 < 
qk 


(which is sometimes called the half-rule condition). 


Remark 14.4. The condition 


k+1 
Th < 
qk 
is equivalent to the condition 
[an4134n422°°*) < [aegisag: +++: ao). 


Before proving this theorem, we need to prove several preparatory statements. 


Proposition 14.5. Consider an arbitrary nonzero real number a. Then all its con- 
vergents are best approximations for Q. 


Proof. Let px/qxz be the k-convergent to a (we assume that the integers px, and gx 
are relatively prime). We prove the proposition by induction on k. 


Base of induction. If the denominator g;, equals 1, then we have either | a| for k =0 
or sometimes |a@| +1 for k = 1 when a; = 1. As we have agreed in Remark 14.1, 
these two numbers are best approximations. 

Step of induction. Suppose that the (k—1)-convergent p,_1/qx_1 is a best approxi- 
mation. Let us prove that the k-convergent p;/q, is a best approximation (we sup- 
pose also that q, > 1). Set 


A= (pe-1,94x-1) and B= (px, gx). 


From Theorem 3.1 it follows that A and B are vertices of two different sails for two 
angles defined by the ray y = ax together with two positive coordinate rays. 
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ry y= ax 


Fig. 14.1 There is no integer points in the strip between the lines /; and lp. 


Denote by /; the line passing through the points O and A, and by / the line 
passing through the points B and parallel to /;. Draw the line through B parallel to 
the line x = O and denote the intersection of this line with the ray OA by C (see 
Fig. 14.1). 

From Proposition 1.15 it follows that the triangle AAOB is empty. Hence, there 
are no integer points in the strip between the lines /; and /2 (which is gray in 
Fig. 14.1). Thus every integer point of the triangle ABOC is either in the line OC or 
coincides with B. Therefore, every integer point (p,q) with qx_1 < q < qx satisfies 

2a ina a) = |e 

q qk-1 qk qk 
(the last equality follows from Proposition 1.21). By induction, pz_1/qx—1 is a best 
approximation. Hence if g < gx_1, then 


S Pk-1 
Fk-1 


s, | Pk 


a Age 
q qk 


(the second inequality follows from Proposition 1.21). Therefore, the k-convergent 
Px/qx is a best approximation. The induction step is complete. 


Lemma 14.6. Consider a nonzero real number a > 0 with regular continued frac- 
tion |ag;a : +++]. Let pi/q; be its i-convergent for i=0,1,2,.... Then for everyk > 1 
every best approximation p/q of & satisfying 


Uk-1 <9 54 


lies on the line passing through the point (px,qx) and parallel to the vector 
(Pe—-1;9k-1) (it is the line ly in Fig. 14.1). 
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Proof. Let A = (pr_1,q9x-1) and B = (px,q,). Suppose that the lines /; and /y are as 
above (see Fig. 14.1). 

Consider E(p,q). If ld(E,OA) > 1, then the triangle AEOA contains at least one 
integer point E’(p’,g’) incident with AEOA distinct from E and not contained in 
AO. First, it is clear that g' < q, since E has the greatest second coordinate among 
all the points of the triangle AOAE. Secondly, E and E’ are in the same half-plane 
with respect to the line y = ax. Hence, 


?—al> 
q 


Therefore, p/q is not a best approximation of a. 

Suppose now that the point FE is in the same half-plane with the point A with 
respect to the line y = ax. Then the angle between OA and y = @x is not greater 
than the angle between OE and y = ax. Therefore, E is not a best approximation. 


So if E is a best approximation satisfying the conditions of the lemma, then 
Id(E,OA) = 1 and E is in the same half-plane with the point B. Therefore, E is on 
the line /o. 


Corollary 14.7. All best approximations of & = |ag;a, : --+| satisfying 
Gk <9 5 4k+1 
are of the form [ao;-++ : ag: m|, where m is a positive integer not greater than ag. 


Proof. From Lemma 14.6 we know that all best approximations satisfying the con- 
dition of the corollary are on a line /2. All integer points of this line are of the form 
B+m0OA for an integer m, or in other words, 


(Pek) + M(PK-159k-1): 
From Proposition 1.13 it follows that 


Pk T™Pk-1 
dk + Mqk-1 


= [ao3-++ : ag: mj. 


For m < 0 the point B + mOA is not in the first octant. For m > a; the second coor- 
dinate of the point B+ mOA is greater than gx. Therefore, we have 0 < m < ag. 


Proof of Theorem 14.3. By Corollary 14.7 we know that all best approximations are 
of the form 


[ao3-++ ag: x], 
where 0 < x < ag41. By Proposition 14.5 we get that all the convergents (i.e., the 
case X = dax41) are best approximations. It is easy to see that 


[ao3-++ 1g: x] > [dos-++ tay xt 1] 
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for x = 1,2,...,a,—2. Therefore, if [ao;--- : ag : x] is a best approximation, then 
[ao3-++ : a, : x + 1] is also a best approximation. 

Now it remains only to find the smallest number x in the set {1,2,...,a,—2} 
such that the number [ao;--- : ag : x] is a best approximation. By Proposition 14.5 
we know that the previous best approximation is exactly p,/qx. So we should check 
when the following inequality holds: 


|[ao3+++: az :x]-—@t| = |[aos+++ 2 a1 : ax] — Gt}. (14.1) 
Recall that 
GVA jo 
[ag; +++: ax] = —; 
qk es 
dg Sagrate ee ees, 
kx T Gk-1 
Therefore, 
|[a0;-++ a1 :a;J—a| = [Pedk—1 — UkPk-1| 1 
, G(r + Ok-1) (Gere + 9-1) 

and 
|[a0;-+-:a,:x]-a| = [PkIk—1 = WkPk-1N(Tk+1 =) _ ae 


(qx +n—1) (Gare +91) (RX + -1) (Gerke + 4k-1) 


Here |pxqgx—1 — 9kPx-1| = 1 holds by Proposition 1.15. Notice that we do not con- 
sider the case x > rz,1, since then the denominator of the corresponding fraction is 
greater than g,,,. Hence inequality (14.1) is equivalent to the inequality 


Tktl— 1 
k+l —*X < 
(GeX+4K-1) — Ok 


which is equivalent to the following one: 


Qk-1 
Tk+1— “g 


dk 
x> 
2 


Suppose that a; , is odd. Then 


Es “ an er s Ea a 
2 2 2 


Therefore, for an integer x, inequality (14.1) is equivalent to 


ak+1 
> 
2 


Now suppose that az, 1 is even. Then 
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dk 
E Tit — “Ge | 
2 2 2 


[+1 


Therefore, for an integer x, inequality (14.1) is satisfied for 


Ak+1 
> 
2 


Finally, it remains to study the case of x = az, /2. We should check whether the 
following inequality holds: 


qk-1 
Tk+1— “gq 
Ik 
a4 /2 > 5) 
It is equivalent to 
qk-1 
Ak+1 > Tkp-1- ——, 
qk 
and hence 
Ak+1dk — Fk-1 
Se EET 


qk 
According to Proposition 1.13, the numerator of the left part equals to g,.1. So for 
X = ax41/2, inequality (14.1) is equivalent to 
jee ThA 
qk 
We have described all solutions of inequality (14.1). This concludes the proof of 
Theorem 14.3. 


14.1.2 Strong best Diophantine approximations 


We say that a rational number p/q (where g > 0) is a strong best approximation of 
areal number a if for every fraction p’/q' with 0 <q <q’ we get 


|q'a — p'| > |qa— pl. 


In this subsection we show that all strong best approximations are convergents of 
the corresponding number. In particular, every strong best approximation is a best 
approximation. 


Proposition 14.8. Every strong best approximation of a real number @ is a best 
approximation of Q. 


Proof. Consider a strong best approximation p/q of a. Then for every rational num- 
ber p’/q' with0 <q <q’ we get 
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! , 
lga— p'| > |qa— pl. 


Since 0 < q' <q, we get 


15 | 
7|7a—p'| > —|qa— pl, 
q q 


which is equivalent to 


Hence p/q is a best approximation. 


Theorem 14.9. The set of all strong best approximations of a real number Q coin- 
cides with the set of all convergents of Q. 


Remark 14.10. In the case of rational o& = [ag;a1 : --- : dn] (where a, 4 1) the num- 
ber p'/q' = [ao;a1 : +--+: dn—1] is not a strong best approximation of a. In this case 
there is a unique number pp—1/Gn—1 = [G0341 : ++: : Qn—1] With smaller denominator 


that is as good as p’/q’: 
|’ — p"| = |an—10 — pnt]. 


Proof. Let p;/q; denote the i-convergent of @ = [ao;a) : az: ---]. 

From Proposition 14.8 it follows that every strong best approximation is a best 
approximation. By Theorem 14.3 every best approximation is on a sail of the cor- 
responding angles. Therefore, by Theorem 3.1 and Proposition 1.13, every best ap- 
proximation can be written for some k > 1 as 


hpe+ Pr-1 


; (14.2) 
hak +k-1 


where h varies from 0 to ay,,. For h = 0 and h = az, we get the k-convergent and 
the (k+1)-convergent respectively. If 0 < h < ay41, then on the one hand, 


|(hqx + G@x—1) — (hp + Pr-1)| 

= |((A+1)qe+e-1)@— ((h +1) pe + Pre-1) — (0 — Pr) 
(a+ 1) gx + 4x1) — ((h+ 1) p+ Pe-1)| +1(Ge% — Pr)| 
> |qea— pel; 


but on the other hand, hq; + qx_1 > qx. Hence for 0 < h < ag, , the rational number 
of expression (14.2) is not a strong best approximation. Therefore, every strong best 
approximation of & is a convergent of a. 


For the case of convergents we have 


14k+1% — peril = |(Ge+1 + 9%) O — (Peri + Pr) — (4xO— px)| 
< — |qe — pr| — \(Ge+1 + 9x) — (Peri + PK)| < |qxo — pel. 
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The first inequality holds since the integer points (pgi1,q¢x41) and (peri t+ Pe. degit 
4x) are in different half-planes with respect to the line y = ax. Therefore, all the con- 
vergents are strong best approximations. 


Example 14.11. Consider the example of a = aa = [2;4: 5]. All best approxima- 
tions and strong best approximations are shown in the following table. 


Approximations || [2]}[2;3]|[2;4]}[2;4:3] |[2;4:4] [2;4:5]] 
Best +) + | + + + ee 
Strong Best |/ +] - | + - E + | 


14.2 Rational approximations of arrangements of two lines 


In the previous section we studied the classical problem of an approximation of real 
numbers by rational numbers. In geometric language this means that we approx- 
imate some real ray with vertex at the origin by integer rays with vertices at the 
origin. Now we say a few words about the approximation theory of arrangements of 
two lines intersecting at the origin. On the one hand, the situation here reminds us 
of the situation of simultaneous approximation, while on the other hand, algebraic 
aspects of arrangement approximation are not visible for simultaneous approxima- 
tion. We briefly touch on the multidimensional case in Chapter 26. (For extra details 
we refer to [109].) 


In Section 14.2.1 we associate to any arrangement of two lines a Markov— 
Davenport form. (Actually, we have already seen these formlys while studying the 
Markov spectrum in Chapter 7.) Further, in Section 14.2.2, we say a few words about 
a natural class of integer approximations for arrangements and introduce the notion 
of sizes for integer approximations. We define the distance function (discrepancy) 
between two arrangements in Section 14.2.3. 

In Section 14.2.4 we prove a general estimate for the discrepancy of best approx- 
imations for the arrangements defined by pairs of continued fractions with bounded 
elements: it turns out that in this case, the discrepancy is proportional to N~*, where 
N is the size of the corresponding best approximation. Further, we give an example 
of an arrangement one of whose continued fractions has a rapid growing sequence 
of elements. In this case the discrepancy is asymptotically not better than N~!~©. 
We study best approximations of arrangements whose lines are eigenlines of certain 
matrices in SL(2,Z) in Section 14.2.5. Finally, in Section 14.2.6 we show how to 
simplify the calculation of best approximations in practice. 
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14.2.1 Regular angles and related Markov—Davenport forms 


In this section we study arrangements of two distinct lines passing through the ori- 


gin. 
We start with the notion of Markov—Davenport forms. 


Definition 14.12. Consider an arbitrary quadratic form 
f(x,y) = ax" + bxy + ey" 


with real coefficients and positive discriminant A(f) = b* — 4ac. Then the form 


is called the Markov—Davenport form. 


Consider an arbitrary arrangement R of lines /; and /2. Let the linear forms a,x + 
biy and a2x + b2y attain zero values at all vectors of J; and /2 respectively. There 
is a unique, up to a sign, Markov—Davenport form associated to R whose zero set 
coincides with the union J; Ul. It is written as 


(aix + by) (aox + boy) 


ajbz — ab, 
We say that this form is associated to the arrangement R and denote it by ®p. 


Example 14.13. Let us write the Markov—Davenport form for the arrangement of 
eigenlines of the Fibonacci operator 

11 

10)" 


The Fibonacci operator has two eigenlines, 
y=-—Ox and y= 0-'x, 


where @ is the golden ratio, i.e., 0 = Eee The Markov—Davenport form of the 
Fibonacci operator is 


x)(y—-O7lx 
wrong = Fe xy—-y’). 
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14.2.2 Integer arrangements and their sizes 


To construct an approximation theory for arrangements one should choose the set 
of all approximations and define the quality of each of the approximations of this 
subset. We consider the set of all integer arrangements as the set of approximations: 
an arrangement of two lines passing through the origin is called integer if both its 
lines contain integer points distinct from the origin. Let us define the quality of the 
approximation (which we call the size). 


For a vector v = (a,b) denote by |v| the norm max(|a|, |b|). 


Definition 14.14. Consider an integer arrangement R of lines /; and /) passing 
through the origin. Let v; and v2 be two integer vectors of integer length one con- 
tained in /; and /p respectively. The size of R is the real number 


max(|vj|,|v2]), 


which we denote by v(R). 


Remark 14.15. For a generic irrational arrangement the best approximation of any 
size greater than | is unique. Nevertheless, for particular cases there could be more 
than one best approximation of the same size. 


14.2.3 Discrepancy functional and approximation model 


Let us now define a natural distance between two arrangements in the plane passing 
through the origin. 


Definition 14.16. Let R), Ro be two arrangements of pairs of lines in the plane. 
Consider two symmetric bilinear forms 


Pr, (v)+@p,(v) and pr, (v) — Pp, (v). 


Take the maximal absolute values of the coefficients of these forms. Suppose that 
they equal cy and c. The discrepancy between R, and R> is min(c;,c2). We denote 
it by p(R1,R2). 


Remark 14.17. In case of arrangements R; and R2 of ordered lines one should con- 
sider the maximal absolute values of the coefficients only of one form 


Pr, (v) — Pr, (v). 


The problem of best approximations of an arbitrary arrangement by integer ar- 
rangements is the following. 


Problem of best approximations of arrangements. For a given arrangement R 
and an integer N > 0, find an integer arrangement Ry such that vV(Rv) < N and 
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p(R,Rv) = min {p(R,R’) | R is integer, v(R’) < N}. 


Remark 14.18. The space of all arrangements of two lines passing through the ori- 
gin is two-dimensional. Nevertheless, the complexity of approximation by integer 
arrangements is more complicated in comparison with simultaneous approximation 
of a line in R? (whose configuration space is also two-dimensional). This happens 
becouse for the approximation of the arrangement of two lines one needs two pairs 
of relatively prime integers, while for simultaneous approximation of a vector in R?, 
a triple of relatively prime integers suffices. Nevertheless, geometric properties of 
continued fractions essentially simplify the approximations of arrangements. This is 
relevant especially for so-called algebraic arrangements, whose lines are eigenlines 
of some SL(2,Z) matrix. In this case all the approximations are constructed from 
some periodic sets that are easy to construct. 


14.2.4 Lagrange estimates for the case of continued fractions with 
bounded elements 


In this subsection we prove an analogue of Lagrange’s theorem on the approxima- 
tion rate of arrangements of two lines y = a x and y = Q)x whose coefficients 
and QQ have continued fractions with bounded elements. In particular, this includes 
all algebraic arrangements (as in Remark 14.18). For the convenience of the reader 
we do not consider the degenerate case in which one of the lines is x = 0, which is 
not so important, since it is equivalent to the approximation of a vector in the plane. 


Theorem 14.19. Consider an arrangement R of the lines y = Ox and y = Qpx. 
Let Q and Qy be real numbers having infinite continued fractions with bounded 
elements. Then there exist constants C,,Cz > 0 such that for every integer N > 0, 
the best approximation Ry satisfies 


C2 


G 
+ <p(R,Ry) < oe 


nN? 


This theorem is a reformulation of one of the results of [109]. We begin the proof 
with the following two technical lemmas. 
Let Rg, 5, denote the arrangement of lines y = (a + 6;)x for i= 1,2. 


Lemma 14.20. Consider a real number € satisfying 
0<€<1/|a);— a]. 
Supposing that p(R,R5,5,) < €, then 


(1 + |@2|) (or — a)? 


(1+ |on|)(a1 — o%)* 
|ou|(1 — €]04 — am|) 


b| < 
ul < ye 1d —elen — on) 


and |d2| < 
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Proof. The Markov—Davenport form of R¢5, 5, is 


_ (y= (a1 +61)x) (y — (a2 + &)x) 
i530) — et) (+o) 


Then the form ®r — Pps & is as follows: 


ox 52 ~ 03 61 + 01.518 — 026152 
(O, — On) (A — A +6, — bp) 
ct 0251 — 061 02 5 
(Q, — Oy) (Oy — A + 6 — bp) 
o> — 6) . 
" ( — O) (04 — O + 8 — d2) . 


(14.3) 


Consider the absolute values of the coefficients at y* and at xy for the form ®g — 
Br 51,85" By the conditions of the lemma these coefficients are less then €. We have 


Oo — 6} 
eee =e (14.4) 
02.51 — 0, dp 
Perr eee <E. (14.5) 


Inequality (14.4) implies 


(@1 — a)? 
1—e]a, —a| 


|; — d9| < 


From Inequality (14.5) we have 
|(O) — Qn) (Oy +6, O)|€ |o01 (01 &2)| 


ron < 
os ia | 
and hence we have 
(a%—a)" (@% 0)" 
154) < | — | (1 — | + Tee af EVE +101 ea, ao] E 
| | 


(1+ |on|)(@1 ~ o%)° 
|Q2|(1 — €|a% — |) 


The inequality for 6) is obtained in a similar way. 


Lemma 14.21. Let € be a positive real number. Suppose that real numbers 6, and 
&p satisfy |b;| < € and |dz| < €. Then the following estimate holds: 
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max (2,2(\a]+ |or2|), 0? +05 + |or — ane) 


R,R < 
P(R.R5,,5)) (jor1 — a|) (Jey — | +2€) 


Proof. The statement of the lemma follows directly from the estimate for R— Rs, 5,. 
shown in (14.3). For the coefficient at x” we have 
07 &y — 05.5; + 015, dy — 5 5 0? +05 + |r) —Ob|€ P 
(G1 — OQ) (OQ) — A + 6 — dy) (|O1 — Q2|)(|O1 — Q|+2e) 


For the coefficient at xy we have 


02.61 — 061 02 Be 2(|Q1| + |O2]) Z 
(1 — O)(Q — OA + 6, — by) ({o%) — |) (|, — O|+2€) ~ 


For the coefficient at y? we have 


Oo — Oj y< 2 
(GQ — O2)(Q)1 — Oy + 6, — dy) (|O1 — Q|) (| — O2| +2e 


yo 


Therefore, the statement of the lemma holds by the definition of discrepancy. 


Proof of Theorem 14.19. Let us start with the proof of the inequality 


or 


N2 < p(R,Ry). 


Let o = [ao;a1 : ---] and p;/q; = [ao;a1 : --- : aj] fori = 0,1,... . Without loss of 

generality we assume that N > ag. Assume that k is the greatest integer for which 
De <N and gq, < N. Then we have the following estimates: 

min ( aba 

k+1 


> 1 
gk+1 (Ge+i + 4k+42) 


Qa 


a 2 Inlew. ils) > 


> 
~ (aii +1) ge ( (aie + 4k + (dey. + y (ax42 + 1) qx) 


= 
~ (4e¢1 +1)? (ae42 +2) N? 


The second inequality follows from Exercise 1.3 on page 20 (we leave the proof for 
the reader). For the third inequality we use the following inequality twice: 


ditt = 419i +qi-1 < (a +1)qi, (14.6) 


which follows directly from Proposition 1.13. 
The same calculations are valid for a2. After that, the constant C) is obtained by 
Lemma 14.20. 


Now we prove the inequality 
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C2 
p(R, Ry) > N2° 


Again we assume that & is the greatest integer for which |p;| < N and q, < N. 
For a we have the following: 


Pk 
qk 


Pk+1 Pk 
GUk+1 =k 


tC aed. en $1) (leu +1)? 
OkIK+I Geis N? 


Q ae 


The first inequality follows from the fact that if the k-convergent is greater than a, 
then the (k+1)-convergent is less then @, and conversely. Proposition 1.15 implies 
the second equality. The third inequality follows from inequality (14.6). In the last 
inequality we use the assumption that either gz; > N or |px14| > N, in which case 


Jay|+1~ joy|+1 


(here the first inequality holds since |@ — px+1/qx+1| < 1). From conditions of the 
theorem the set of all elements a; of the continued fraction for @ is bounded. There- 
fore, there exists a constant Ch, such that for every N there exists an approximation 
of a of magnitude smaller than Ch /N*. 

A similar estimate holds for 0). There exists a constant Cho such that for every N 
there exists an approximation of 0 of magnitude smaller than C} , /N >. Therefore, 
we can apply Lemma 14.21 in order to obtain the constant C>. 


Remark 14.22. Now we say a few words about the case of unbounded elements 

of continued fractions. Let € be a small positive real number. If the elements of a 

continued fraction (say for 1) grow fast enough, then there exists a sequence N; for 
C 


which the approximations Ry, are of magnitude Wr: Let us illustrate this by the 


following example. 


Example 14.23. Let M be an arbitrary positive integer. Consider an arrangement 


R of the lines y = 0 and y = ax, where the continued fraction [ag;a) : ---] for @ is 
defined inductively: 

—ay = 1; 

— let ag,...,ax be defined and [ag; +++ : ax] = ao then ay, = (qx)!—1. 
Let Ni, = |“¢+| +1 for k = 1,2,... . Then there exists a positive constant C such 


that for every integer 7 we have 


Cc 
p(R,Rn,) 2 yiti/M 


Proof. For every i we have 
qin > igi =a qi =i". 


Therefore, the magnitude of best approximations for k-convergents are as follows: 
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1 1 1 1 
> > = 


Pk 

< : 

= = 1/M 1+1/M 1+1/M 
qk(4k+1 + 4k) Wn (24x41) es Deine 


qk 


Qt) 


From Theorem 14.3 it follows that there is no best approximation whose denomina- 
tors q satisfies 
Uk <9 SNe. 
Therefore, Ry, = R > . Hence 
4k 
Cc 


1+1/M° 


P(R,Rv,) = P(R,R% ) = 
qk N, 


The last inequality follows from Lemma 14.20. This completes the proof. 


We suspect the existence of a badly approximable arrangement R and a constant 
C such that there are only finitely many solutions N of the inequality 


C 
p(R, Rn) S N’ 


as in the case of simultaneous approximation of vectors in IR3 (see, e.g., in [138]). 


14.2.5 Periodic sails and best approximations in the algebraic case 


14.2.5.1 Geometric w-continued fractions 


Let us prove a relation between classical geometry of numbers and best simultane- 
ous approximations. 


Definition 14.24. Define inductively the w-sails for an arbitrary angle C with vertex 
at the origin. 

— let the 1-sail be the sail of C. 

— suppose that all w-sails for w < wo are defined. Then let the wo-sail be 


wo-l 


re) (conv ((cnz"\{o}) \ U w-sail) ) ; 


where conv(M) denotes the convex hull of M. 

Consider an arbitrary arrangement R of two lines /; and /2. The union of all four 
w-sails for the cones defined by the lines /; and /2 is the geometric continued w- 
fraction of R. 


In Fig. 14.2 we show the geometric continued 1-, 2-, and 3-fractions for a partic- 
ular arrangement of two lines. It is interesting to notice that the geometric continued 
3-fraction in the figure is homothetic to the geometric continued 2-fraction and to 
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Fig. 14.2 The arrangement of two lines and its geometric continued 1-, 2-, and 3-fractions. 


the geometric continued fraction (which is a |-fraction). It turns out that it is a gen- 
eral fact that the w-continued fraction of any general arrangement R (i.e., whose 
lines do not contain integer points except the origin) is homothetic to the geometric 
continued fraction of R; the coefficient of homothety is w. We leave this statement 
as an exercise for the reader. 


14.2.5.2 Algebraic arrangements 


An arrangement of two lines /;,/2 is said to be algebraic if there exists an operator 
A € GL(2,Z) with irreducible, over Q, characteristic polynomial having two real 
roots whose eigenlines are /; and /2. In this case the operator A acts on a w-geometric 
continued fraction (for any w) as a transitive shift. 


Definition 14.25. The minimal absolute value of the form ®g on the integer lattice 
minus the origin is called the Markov minimum of ®r. We denote it by ap. 


Recall that Markov minima are studied in the Markov spectrum theory discussed 
in Chapter 7. 


Lemma 14.26. Let v be an integer point in the w-continued fraction of an algebraic 
arrangement R. Then 
|De(v)| > wae. 


Proof. The statement follows directly from the fact that the w-continued fraction of 
R is homothetic to the continued fraction of R. 


Theorem 14.27. Let R be an algebraic arrangement. Then there exists C > 0 such 
that for every N € Z4. the following holds: each of the two lines of the best approx- 
imation arrangement Ry contains an integer point of some w-sail for w <C. 
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Proof. Let R be an arrangement of lines y = Qx and y = dx. Let also 


Vin(XinsyYin) and vo n(x2,y,y2,y) 


be two integer vectors of unit integer length in the lines defining Ry. 
Since R is algebraic, the continued fractions for @& and 2 are periodic and there- 
fore uniformly bounded. Now we can apply Theorem 14.19 to write the estimate 


Hence by Lemma 14.20 we have 


Notice that 


(x1 = O1y1w) (X1,N tz O2y1.N) 
QA, — A 


|Pr(x1N, Yi) | = 


= j=“ 6 | “a 


YIN Qa — A 


The first term above is bounded by C/N? for some constant C that does not depend 
on N. Hence, 


y? XIN ~ 1.N —~ 
=|71,N Yin | YIN 
IPa(xiwyiw)| SC]: <C|= 

N A, — A A, — A 


Finally, the last expression is uniformly bounded in NV. The same property holds for 
the values | ®p(v2,~”)|. 

Therefore, by Lemma 14.26 the vectors of vj and v2,n for all N are in the union 
of a finite number of w-continued fractions for R. 


Conjecture 7. We conjecture that for all algebraic arrangements for almost all NV, 
the vectors vy and v2 y defining Ry are in a 1-geometric continued fraction. 


14.2.6 Finding best approximations of line arrangements 


In this subsection we show a general technique for calculating best approximations 
for an arbitrary arrangement of lines R with eigenspaces y = x and y = Qx for 
distinct real numbers a and Q). 


Proposition 14.28. Let m and n be two integers. Supposing that |a — Al <6 (or 
|Ot2 — All < 6, respectively), then the following holds: 
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“ 2 | —O2| | Pr(p,q)| 
q |O, — O|+6 i 
( Pp |o, — O| aga) 
(04) 5) 
q |Ot) — Q|+6 q 


_ 1 |p=aq|(p = 02g) 


P— O2q 
|r(p,q)| |O4, — | | —0| | Pr(p,q)| 
gq | —O&%+(2—m%)|~ |a—m|+d  &? 


The same holds for the case of the approximations of a. 


Procedure of best approximation calculation. 
Input data. We start with an arrangement R of two lines y = a)x and y = Qx anda 
positive integer NV. The task is to construct the best approximation Ry. 


Step 1. Find p;/q;, which is the i-convergent to a with maximal possible denomi- 
nator satisfying 
1<pj<N and 1<q<QN. 


Also find P; / qj. the convergent satisfying the same property for Qo. 


Step 2. Consider the arrangement R of lines 


y= Ps and y= Pi y 
qi qj 
We know that 
aS and a-—|< ae 
qi Gidi+1 qj Fjj+i 
By Lemma 14.21 (for N > 1) we have 
= 1 1 
p(R,R) ae Cmax (—. ——), 
Gidi+1 FjVj+1 


where 
max (2,2(loe1| + |or|), o7-+023 + [on — a) 


(Jor — |) (Jor — |) 
Step 3. From the definition of best approximations and the result of Step 1 we have 


D 1 1 
p(R,Ry) < p(R,R) < Cmax (—_., ——). 
GiGit) FjQjui 
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In most cases (except for a small denominator NV) this estimate satisfies 


1 


RRS =, 
p( nv) 2\a — a| 


Choose 6; and 6) such that the lines of the best approximation Ry are y = (a + 6) )x 
and y = (Q + 6y)x. In the notation of Lemma 14.20 this means that Ry = R35, 5,. 
Now we apply Lemma 14.20 to write the estimate for 6; and 6: 


2(1+Ja1|)(a — a)? 1 1 
i < 2A LoD OF get), 


| 2] Gidi+1 FjVjr1 
2(1 + |a2|)(o4 — an)? 1 1 
|d2| < ( Jan) (1 2) “Cmax ( TT } 
| O41 | Gidit1 TV ja1 


where C is as in Step 2. 


Step 4. Suppose now that the lines of the best approximation Ry contains integer 
vectors (11,1) and (n2,mz) of unit integer length. By Proposition 14.28 we write 
the following estimates on the values of the Markov—Davenport form ®p at these 
vectors: 


@r(mj,n Q —|+6 m Q) — O|+6 

R ( Hy i 2s 1— O|+ 6) fo 1 |Ol, — | 1g. 
ny |O, — Q| n\ | — a| 

@r(mz,n2) _ | —O2|+ Pi: |) — O| +65 
ny a) — a| nz | 41 — Op 


Step 5. This is the last step. Here we should consider all integer arrangements that 
contains integer points (m',,n',) and (m/,n‘,) satisfying the inequalities obtained in 
Step 4 for ®r(m;,n;) for i= 1,2. Then one should choose the arrangement with the 
smallest discrepancy, which is Ry. 


Output. In the output we have the best approximation Ry. 


Remark 14.29. The above procedure of best approximation calculation seriously 
decreases the number of integer points in the square N x N that should be tested to 
find Ry. This is of a great importance especially for the case of algebraic arrange- 
ments. The procedure gives an explicit estimate on the number of w-sails whose 
points should be checked to construct best approximations. Since all w-sails are pe- 
riodic, all its points are contained in the finite number of orbits with respect to the 
corresponding Dirichlet group action. 


Example 14.30. Input data. Consider an arrangement of eigenlines of the Fi- 


bonacci operator: 
01 
11) 


Denote by F,, the nth Fibonacci number. 
Consider any integer N > 100. 
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Step 1. Consider a positive integer k such that Fy, < N < Fy41 and choose an approx- 
imation R with eigenspaces Fy_;y — x = 0 and hy + Fy_1x = 0. Then 


(04 Fi < a gai ! 
Fa | 7 FF’ YK Fy Fy 
Step 2. Since 1/(F,_1F) < 1/(55- 89), we have 
PENS 2/5 Fifi Js N22 ~ ND 
5+ a395 kW 3+ 3895 


Step 3. Now, following Lemma 14.20 we write the estimates on 6, and 6) of the 
best approximation Ry = R35, 6,: 


80.35 18.97 


|51| < N2 and |O2| < N2 


Step 4. Suppose that the best approximation Ry is defined by the two integer vec- 
tors (m,,n,) and (m,n) of unit integer length. Then we have the following two 
estimates: 


|Ba(m,n1)| _ 80.65 |Ba(m2,n2)| _ 18.99 
ne ye ny Saye 


Step 5. and Output. We have completed the computations for N = 10°, and the 
answer in this case is the matrix with eigenspaces Fy9y — F39x = 0 and F39y + Frox = 
0. We conjecture that for the Fibonacci matrix all the best approximations are the 
arrangements of lines F,_jy — Fx =O and hy+Fy_1x =0 fork =0,1,.... 


14.3 Exercises 


Exercise 14.1. Find all best approximations and all strong best approximations for 
the numbers 433/186 and 575/247. 


Exercise 14.2. Let AABC be an integer empty triangle. Consider a strip bounded 
by two lines parallel to AB and passing through the points A and C respectively. 
Prove that all integer points on this strip are at its boundary lines. 


Exercise 14.3. Consider a positive real number & with a regular continued fraction 
[ao;a1 : ---]. Let x be an integer satisfying 1 <x < ag, — 1. Prove that 


|[aosa1 i +++ 2 ae: x] —Q| > \laosai: +++ :ag:x+1]-al. 
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Exercise 14.4. Let @ = |ao;a) : ---] be a positive real number. Denote by p,;/qzx its 
k-convergent and by r, the remainder part [a,;a,41 : ---]. Then the condition 
He qk+1 
qk 


is equivalent to the condition 
(ag413Ak42 : vo] < [an413 Ak wees ao]. 


Exercise 14.5. Write Markov—Davenport forms for the arrangements R; and R2 of 
eigenlines of the operators 


os a te) 


respectively. Write the discrepancy between R; and R2. 


Exercise 14.6. Consider an arbitrary angle C with vertex at the origin whose edges 
do not contain other integer points. Prove that the w-sail of C is homothetic to the 1- 
sail of C and find the coefficient of homothety. What happens for the angles having 
integer points on their edges? 


Exercise 14.7. Let R be an algebraic arrangement of two lines. Then the following 
statements hold. 

(a). The Markov—Davenport form @p attains only a finite number of values at the 
integer points of the corresponding continued fraction. Denote this set by S. 

(b). The form @g attains the value x at some vertex of the w-sail if and only if 
x/wes. 


Exercise 14.8. Find all best approximations of sizes 1,2,3,4,5 for the arrangement 
of the eigenlines of the Fibonacci operator. 


| ® 
Check for 
updates 


Chapter 15 


Geometry of Continued Fractions with Real 
Elements and Kepler’s Second Law 


In the beginning of this book we discussed the geometric interpretation of regular 
continued fractions in terms of LLS sequences of sails. Is there a natural extension 
of this interpretation to the case of continued fractions with arbitrary elements? The 
aim of this chapter is to answer this question. 

In Sections 15.1 and 15.2 we introduce a geometric interpretation of odd or in- 
finite continued fractions with arbitrary elements in terms of broken lines in the 
plane having a selected point (say the origin). Further, in Section 15.3 we consider 
differentiable curves as infinitesimal broken lines to define analogues of continued 
fractions for curves. The resulting analogues possess an interesting interpretation in 
terms of a motion of a body according to Kepler’s second law. 


15.1 Continued fractions with integer coefficients 


In this section we give a formal extension of the notion of the LLS sequence for 
sails to the case of certain integer broken lines. Let us specify the class of broken 
lines we are dealing with. 

A broken line is called integer if all its vertices are integer points. We restrict 
ourselves to the case of broken lines all of whose triples of consecutive vertices are 
not in a line. 


Definition 15.1. Consider an integer broken line L in the complement to an integer 
point V. We say that L is at unit integer distance from V (or V-broken line, for short) 
if all edges of L are at unit integer distance from V. 


Remark. Without loss of generality, we mostly use only O-broken lines, where O is 
the origin. 


For arbitrary integer points A, B, and C, we consider the function of orientation 
sgn, namely 
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1, if the pair of vectors (BA, BC) defines the positive orientation, 
sgn(ABC) = ¢ 0, if the points A, B, and C are in a straight line, 
—1, if BA, BC defines the negative orientation. 


Let us generalize the notion of LLS sequences for sails to an arbitrary O-broken 
line. 


Definition 15.2. Consider an O-broken line AgA, ...Ay. Put by definition 


ag = sgn(AoVA1)1€(AoA1), 
a, = sgn(AgVA1)sgn(A;VA2)sgn(AoA1A2) lsin(ZA9A1A2), 
a2 = sgn(A1VA2)1¢(AjA2), 


An—3 = $gn(An—2VAn—1) sgn(An—1VAn) sgn(An—2An—1An) lsin(ZAn—2An—1An), 
a2n-2 = sgn(An—1VAn) 1€(Ap—1An). 


The sequence (ao,...,@2n—2) is called a lattice signed length-sine sequence for the 
the broken line AgA;...A,, or LSLS sequence, for short. 

The continued fraction for the O-broken line AjA, ...A, is the element [ap;a, :---: 
a2n-2| of R. 


Notice that the LSLS sequence for the sail of some angle coincides with the LLS 
sequence of this sail. So we consider LSLS sequences as a natural combinatoric— 
geometric generalization of LLS sequences. Note also that an O-broken line is 
uniquely defined by its LSLS sequence, the point Ag, and the direction of the vector 
AoAl. 

In Fig. 15.1 we show how to get signs of elements of the LSLS sequence from 
the local geometry of a broken line. As an example we consider the O-broken line 
with four vertices in Fig. 15.2. 

In the next theorem we state that the LSLS sequence is the complete invariant 
for the action of the group of proper integer congruences (i.e., integer congruences 
preserving the orientation of the plane). 


Theorem 15.3. (i) Two O-broken lines are proper integer congruent if and only if 
their LSLS sequences coincide. 

(ii) Every finite sequence of nonzero integers is realizable as an LSLS sequence for 
some O-broken line. 


We skip the proof of this theorem, since later, in the next section, we introduce a 
more general construction that generalize LSLS sequences for O-broken lines. For 
further information on O-broken lines and their LSLS sequences we refer to [102] 
and [104]. 
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ag = di 
ay = —1; 
ag = 2 
a3 = 2; 
ag = —]. 


Fig. 15.2, An O-broken line and the corresponding LSLS sequence. 


15.2 Continued fractions with real coefficients 


Let us describe a recent generalization of lattice properties of regular continued 
fractions to the case of arbitrary continued fractions (see also [106]). 

Further in this chapter we work in the plane with a marked point O (say the 
origin). For two vectors v and w in the plane we denote the oriented volume of the 
parallelogram spanned by the vectors v and w by det(v,w). 
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15.2.1 Broken lines related to sequences of arbitrary real numbers 


In Chapter 3 (see page 41) we gave a geometric algorithm to construct sails for 
real numbers given the regular continued fractions of the corresponding numbers. 
The algorithm has a sequence of positive integers (ao,a1,...) aS input and its output 
is the sails for the number [ao,a,...]. In this subsection we slightly modify this 
algorithm such that it will construct a broken line starting from an arbitrary sequence 
of nonzero real numbers. This algorithm will give us a base to define continued 
fractions related to broken lines (we do this later, in the next subsection). 


Algorithm to construct broken lines related to sequences of real numbers. 


Input data. Given a sequence (do,q1,...,@2n) of nonzero real numbers; the first 
vertex Ag; and the direction v of the first edge. 


Goal of the algorithm. To construct a broken line AjA, ... Ay. 


Step 1. Assign 
Aj => Ao + Ay, 
where A is the solution of the equation det(OA9,OA,) = ao. 


Inductive Step k. From previous steps we have the vertices Ao,...,Axz, for k > 1. 
Now let us find the vertex A,+1. First, we consider the point 


P=A,+ 


Az_1Ag. 
A2K-2 
Notice that the point P is on the line Ay_,A,z and the area of AOA;P equals 1/2. 


Secondly, we put 
QO = P+ dp,x_ | OAg. 


Thirdly, we define the point A; ;(see Fig. 15.3): 
Aggy = Ag + 2AxQ. 


This concludes Step k. 
Output. The broken line Ag...Ay. 


Remark 15.4. The constructed broken line in some sense generalizes geometric 
construction from the case of sails related to regular continued fractions (see The- 
orem 3.1 above) to the case of broken lines related to arbitrary continued fractions 
(see Theorem 15.10 and Corollary 15.12 below). 


Let us give a geometric meaning of ao,a),... in terms of oriented volumes for 
certain vectors defined by the above broken line. 


Proposition 15.5. Let (ao,a1,...,@2n) be a sequence of arbitrary nonzero elements. 
Consider any broken line Ag ...An constructed by this sequence as above. Then the 
following holds: 
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PQ = a2,0Ax; 
Ap Arti = 4241 AQ. 


Fig. 15.3 Construction of the point Ax+1. 


dx = det(OAx, OAx+1), k=0,...,n, 


a, a BeAr ARAKI) py 
a A2k-2A2 ~ 


Proof. Let us prove this proposition by induction on k. 


Base of induction. Directly from the first step of the construction we get 


det(OAg, OA;) =a. 


Inductive Step. Supposing that the statement holds for k—1, let us prove it for k. 
We start with a;: 


a2xdet(OAx_1, OAx) 


A2k-2 


det(OA,, OAx+1) = aoxdet(OA,, OQ) = axdet(OA,;, OP) = 


= a2. 


The last equality follows from the induction assumption. 

Now we check the expression for a2,_1. 

det(AcAc1,ArAnsi) _ det(AgAr—1,AxQ) 
A2k-242k a2K-2 


= PIT det(OAK_1, OAR) = a2x-1- 
ArK-2 


= det(PAx, PQ) = arx_1det(OA,, OP) 


The inductive step is complete. This concludes the proof. 


Example 15.6. Let us consider the first interesting case of broken lines with three 
vertices. Without loss of generality we fix Ag = (1,0) and the direction v = (0,1). 
Letting the sequence be (ao,a),a2), we have 


Aj it (1,a0). 
The corresponding points P and Q are as follows: 


P=(1,1+ao), Q = (1+a),1+a9+aoa1). 


182 15 Geometry of Continued Fractions with Real Elements and Kepler’s Second Law 


Finally, we have 
A2= (1 + aa2,a9 +42 +4041 a2). 


Notice that the coordinates of the point Az coincide with the denominator and the 
numerator of the continued fraction [a9;a, : ay]. 


15.2.2 Continued fractions related to broken lines 


Let us extend the definition of LLS sequences for sails to the case of broken lines, 
basing the definition on the expressions of Proposition 15.5. Consider an arbitrary 
broken line Ap ...A, satisfying the following condition: any of its edge is not con- 
tained in a line passing through the origin O. 


Definition 15.7. Define 


ar = det(OAx, OAg+1), k=0,...,n;3 
_ det(AgAg_1,AcAny1) 


a2gk-1 = Aap 24k 3 k=1,...,n. 
The sequence (ao,...,@2n) is called the LLS sequence for the broken line. The num- 
ber [ao;--+ : azn] is said to be the continued fraction for the broken line Ao...An. 


This definition extends the definition of LSLS sequences to O-broken lines. 


Proposition 15.8. Let a broken line be an O-broken line. Then its LLS sequence 
coincides with its LSLS sequence. 


Since the proof of this proposition is a straightforward calculation, we leave it to 
the reader. 


Finally, we show how the GL(2,R)-transformations act on the LLS sequences. 
Proposition 15.9. Consider two broken lines Ag...Ayn and Bo...Bn whose LLS 
sequences are (ao,...,@2n) and (bo,...,b2n) respectively. Suppose that there exists a 


GL(2,R)-operator taking the first broken line to the second one. Let the determinant 
of this operator equal 2. Then we have 


a2 = Abr, k=0,...,n, 
ay) = zbx-1, k=1,...,n 


Proof. The statement follows directly from formulas of Proposition 15.5, since the 
area of any parallelogram in the definition is multiplied by /. 


15.2.3 Geometry of continued fractions for broken lines 


Let us recall the notion of polynomials P, and Q; from Chapter 1. The polynomials 
P, and Q; are uniquely defined by the following three conditions: first, they are 
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relatively prime; secondly, 


Py. (x0, ster Xk) 


= (XOX i TX 
Ox(%04%++5%%) 


thirdly, 
P,(0,..-,0) +Q;,(0,...,0) = 1. 
Theorem 15.10. Consider an O-broken line Ag...An with LLS sequence 
(ag,a1 gene 42-2): 
Let also Ag = (1,0) and A, = (1,ao). Then 


An = (Qon—1(a0,41,--+,@2n-2),P2n—1(40,41,---,42n-2)). 


Remark 15.11. Similarly to Theorem 3.1 we have the following continuant expres- 
sion for Theorem 15.10: 


An = (Kon—2(a1,.--,@2n—2), Kon—1 (G0, 41... ,2n—2)). 


Proof. We prove this theorem by induction on the number of vertices in the broken 
line, i.e., in 71. 


Base of induction. Tf the broken line has two vertices Ag, A; and its LLS sequence is 
(ao), then A; = (1,ao). 


Step of induction. Assume that the statement holds for all broken lines with k ver- 
tices. Let us give a proof for an arbitrary broken line with k+ 1 vertices. Consider a 
broken line Ao... Ax. Let its LLS sequence be (ao,...,a2,). 


Let 
TH ie +1 se) ; 
—ao 1 
The transformation T takes A, to (1,0) and the line AzA; to the line x = 1. Under the 
transformation T the broken line Ag...A, is taken to another broken line, which we 
denote by BoB, ... By. Since the determinant of T equals 1, then by Proposition 15.9, 


the LLS sequence for BoB, ... Bx coincides with (ao,...,a2). By the induction as- 
sumption, the statement holds for the broken line B, ...B,. Thus we have 


By = (Qox-1(a2,..-,42k)sPor—1(a2,---,42x)). 
Letting B, = (q,p), then 
Ag = T~' (Bg) = (p +419, ap + (aoa; + 1)q). 


Therefore, we have 
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agp+(aoai+1)q _ i 1 Poey1(Go,a1,.--,@2%) 
=a4 = . 
p+aiq ath Qrx41(G0,41,-.- 42K) 


Now we substitute p = Py,_1(az,..., dx) and g = Qox_1(az,...,a2x). Let us calcu- 
late the value of the sum of the numerator and denominator in the leftmost fraction 
at the point O = (0,...,0): 


((aoPox—1 + (aoai+1)Qox—1) + (Pox—1 +41 Q2x-1)) (O) = (Por—1 + Qox-1)(O) = 1. 


The first equality holds since we take ay = a; = 0. The second equation holds by 
definition of P2,_; and Q>,_,. Therefore, 


Pyp41 (40,415 + ++, 42k) = GoPrx—-1(A2,.-- 42x) + (aoay + 1)Qrx-1(a2,..-, 2%), 
Orx+1(G0,41,--+,G2x) = Pox—1(a2,---, 42) +41 Ooxn-1(a2,.--, ax). 


This concludes the proof for the broken line Ag...Az. The induction step is com- 
plete. 


Theorem 15.10 extends the geometric interpretation of regular continued frac- 
tions to the case of arbitrary continued fractions in the following way. 


Corollary 15.12. Consider a broken line Ag...Ay, with Ao = (1,0), Ai = (1,40), 
and Ay = (x,y). Let & = [ag;a1 : +++: dan] be the corresponding continued fraction 


for this broken line. Then 
y 


x 


=a. 


For the case of an infinite value for the continued fraction for the broken line, x/y = 


0. 


Remark 15.13. Supposing that the sequence of numbers (ao, d@1,...,@2,) in Corol- 
lary 15.12 contains only positive integers, the broken line Ag...A, coincides with 
the sail of the angle defined by rays y = 0 and y = ax in the first quadrant of the 
plane. In particular, Ap ...A, is in the boundary of a convex set and its LLS sequence 
coincides with the LLS sequence of the sail for the angle ZA9OAn. 


From Corollary 15.12 we get the following statement. 
Corollary 15.14. Consider two broken lines Ag ...An and Bo... Bm with LLS sequen- 


ces (aQ,@1,---,;@2n) and (bo,b1,...,b2m) respectively. Let By = Ag and the vector 
AoA, coincide with the vector 7. BoBi. Suppose that 


[ao;-++ : Gan] = [bo;-+ : Dam]. 
Then the points Ay, Bm, and the origin O are contained in one line. 


Proof. Consider a transformation in the group SL(2,R) taking the point Ao to (1,0) 
and A; to some point on the line x = 1. It takes both broken lines to some broken 
lines with Bp = Ao = (1,0) and the points A; and B; to some points on the line x = 1. 
By Proposition 15.9 this operator does not change the continued fraction. Now we 
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[1;-2:2:-3/5:-5] = 5} 


Fig. 15.4 Two examples of broken lines with three edges. 


are in a position to apply Corollary 15.10: the images of the points A, Bj», and the 
origin are on a line. Therefore, the points A,,, B,,, and O are on a line. 


Knowing the continued fraction for a broken line, it is possible to say whether 
the corresponding broken line is closed. 


Corollary 15.15. (On necessary and sufficient conditions for broken lines to be 
closed.) Consider a broken line AgA, ...An with the LLS sequence (ao,d\,..-,@2n): 
Let Ag = (1,0) and AoA be on the line x = 1. Then the broken line is closed if and 


only if 
Qon+1(a0,41,- asa ,42n) =1 and Pon+1 (0,41, Pew ,d2n) =0. 
The condition P,41; = 0 is equivalent to the following one: 
[a03@1 2 +++ 2 don] = 0. 
Notice also that for an arbitrary broken line the conditions are almost the same, as 
seen from Proposition 15.9. 


Example 15.16. Let us write the conditions for a broken line consisting of three 
edges to form a triangle. Suppose that the LLS sequence of this broken line is 
(ao,@1,42,43,a4). Then the conditions are introduced by the following system: 


a041a203a4 + apd) a2 + aga,a4 + aga3a4 + a2a3d4 +.a9 +d2 +a4 = 0, 
ajaza3a4+a)a2 +a\a4+a3a4+1=1. 
In Fig. 15.4 we show two examples of broken lines with three edges. 
We conclude this section with the following open problem. 


Problem 8. Consider a broken line L and a proper Euclidean transformation T (the 
origin may not be preserved by 7). Find the relation between the elements of the 
LLS sequences of the broken lines L and T(L). 
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Due to Proposition 15.9 it is sufficient to solve this problem only for translations 
by a vector. 


15.2.4 Proof of Theorem 4.16 


First let us prove the following statement. 
Proposition 15.17. Consider an integer angle ZAOB with vertex at the origin O 
and with edges passing through A = (1,0) and B = (1,a), respectively, where a 
satisfies 0 < a < 1. Let the odd regular continued fraction for a be 
a= (0; ay eee : A2n_2]. 
Then the LLS sequence of the angle ZAOB is 
(a2,..-,42n—2). 


Proof. The congruence is provided by the multiplication by 
la 
O1/)}° 


We have finally collected all necessary ingredients to to prove Theorem 4.16. 


Proof of Theorem 4.16. Recall that we are given by two linearly independent integer 
vectors 
OA=(p,q) and OB=(r,3s). 


The proof for the statement of Theorem 4.16 is given by the study of numerous 
straightforward cases of various signs for p,g,r,s, and det(OA, OB). Let us study the 
case p,q,1,5 > 0,det(OA, OB) < 0 (we omit the studies of all the other cases here). 


In this case two sequences of integers 
(ao, a1, sae 42m) and (bo,a1, eee , bon) 


are defined as the sequences of elements of odd regular continued fractions of ratio- 
nal numbers 


s 
; = [ao;a, :...: Gam] and on [bo;a1:...: ban]. 


Denote by E the coordinate vector (1,0). Consider the broken line L that is a 
concatenation of the following two sails. First we take the sail of the angle ZAOE 
(in the case where the last edge of this sail is not vertical we formally add the in- 
finitesimal edge EE with vertical direction and 0 integer length) and the sail for the 
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OY 


OE OX 


Fig. 15.5 The broken line corresponding to the concatenation of the sails for ZAOE and ZEOB. 


angle ZEOB (similarly another infinitesimal edge EE is added in the case where 
the first edge of the sail of the angle is not vertical). See Fig. 15.5. 


The broken line L has the following properties: 

— its first vertex is in the ray OA and the last vertex is in the ray OB; 

— the direction of its first edge is inwards (with respect to the angle). 

Then the angle is integer congruent to the angle ZEOC with C = (1,@) where 
|o:| is defined by the LLS sequence of the above broken line as 


O = [Edam : Ed2m—1 2 +++ EQ 2 Eg: 0: bb: bby: +++: Oba]. 


In case of p,q,r,s > 0,det(OA, OB) < 0, the first part of the broken line L will be 
the sail (considered as a broken line) of ZAOE passed clockwise. Thus the elements 
of its LLS sequence are reversed and negative to the values of the LLS sequence 
for ZAOE (i.e., € = —1). Note that in the case where q/p < 1 we end up with an 
infinitesimal (zero length) vertical vector which additionally brings two elements: 
|p/q| for the angle with the vertical line passing through E and 0, indicating that 
we stay at E. Then we switch to the second sail. Both sails are starting vertically 
(or asymptotically vertically in the case where a or bj are zeroes), hence the angle 
between the edges corresponding to ag and bo is zero. By that reason we should add 
a zero element to the LLS sequence for L here. Further we go back following the 
sail of the angle ZEOB (in this case 6 = 1), which is described by the continued 
fraction 

[bo : by 2 +++ bay] 


(here again we formally add bp = 0 and db; = |s/r| if r/s < 1). Therefore the LLS 
sequence of the broken line L is 


(—d2m,—@2m-1,- : -,—41, —a0,0,b0,b1,. eo ,ban). 


Now let us define a as 
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O = [—d2m 2 —d2m—1 +++ —A, 2 —a9 20: bg 2 by 2 +++: ban] 
and let the regular continued fraction for || be as follows 
|a| = [coscy 2 +++ 2 Cox]. 
From Proposition 15.17 we have 


Itan ZEOC = [cose C2: +++: 2x], ifco £0; 
[ca3-++ cox],  ifco =O. 


Therefore, the LLS sequence of ZAOB © ZEOC is either (co,c1,...,C2x) in the case 
where co 4 0 or S = (c2,...,C€2%) in the case where co = 0. This concludes the proof 
of the case p,g,r,s > 0,det(OA, OB) <0. 

The cases for the other choices of signs for p,g,r,s and det(OA, OB) are consid- 
ered similarly, we omit them here. 


15.3 Areal and angular densities for differentiable curves 


Let us go even further and consider a differentiable curve instead of a broken line. 
In this situation it helps to think of curves as broken lines with infinitesimally small 
segments. In some sense the LLS sequence “splits” into a pair of functions related to 
the odd and even elements of the LLS sequence. Further, we call these two functions 
areal and angular densities. In this section we discuss some basic properties of areal 
and angular densities, in particular we shall show that the areal density is the inverse 
to the velocity of a body that moves according to the second Kepler law. 


15.3.1 Notions of real and angular densities 


Throughout this section we suppose that all curves are of class C? and have the 
arc-length parameterization. 


Definition 15.18. Let y be an arc-length parameterized C?-class curve. By definition 
the areal density and the angular density at some point t are 


A(t) = lim sont. Ov(t +£)) 


= det(Oy(t), 7(t)) 


and 
ies det(y(t) 7 — €), YO + €)) 
e0 Edet(Oy(t — €), Oy(t)) -det(Oy(t), Oy(t+e)) 


We continue with the following geometric interpretations for the function A. 


B(t) 
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Proposition 15.19. (Areal density and the Kepler’s second law.) Suppose that a 
body moves along the curve Y with velocity 1/A. Then the sector area velocity of the 
body is constant and equals 1. 


The proposition follows directly from the definition. 


Let us show that the value of A7B at a point t coincides with the signed curvature 
at t (which we denote by «(f)). 


Proposition 15.20. Let the vectors Oy(t) and (t) be noncollinear at some moment 
t. Then the following holds: 


Proof. By definition we have 


A?(t)B(t) = 

: (eeoeen) det(y(t)y(r—€), y(t) y(t +€)) 

e-30 é edet(Oy(t — €), OY(t)) -det(Oy(1), OV(t + €)) 
— tim CMON = €), YOM + €)) 

€>0 e3 : 


Let us rewrite the curve yin coordinates, i.e., y(t) = (x(t), (t)). Since the parameter 
t is arc-length we have 


IYML= Ve OP+0'@P=1 and w(t) =x'(e)y"(e) — yO"). 


Let us consider the Taylor series expansions 


y¥'(t) + terms of higher order, 


V(t+e) = V(t) +e7 (1) 


y(t—e€) = y(t) -ey (t) + <1) +terms of higher order. 


Let us substitute 


= 
= 
= 
= 
~ 
tap) 
a 
I 


7( 


ex’ + Beet yey + ay ) +terms of higher order, 
y(t)y(t —€) = (ex + ex, ey + Sy") + terms of higher order. 


Hence we have 


€3(x'y" — yx") + 4th order terms 
m 


lim =l 
e0 ro €0 e3 
___ €>«(t)+4th order terms 
= lim 5 
e0 E 
= k(t). 


Therefore, A*(t)B(t) = K(t). 
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Let us prove the theorem on existence of a curve with a given angular density. In 
some sense this theorem is a smooth analogue of the algorithm to construct a broken 
line by the elements of the corresponding continued fraction of Section 15.2.1. It is 
interesting to observe that to reconstruct a curve it is necessary to know the areal 
density (which corresponds only to odd elements in the LLS sequences for broken 
lines). 


Theorem 15.21. Suppose that we know the areal density A(t) smoothly depending 
on a parameter t in some neighborhood of to, the starting position (to), and the 
origin O. 

— If |A(to)| > |Or(t) 
tions 

— If |Oy(to)| > |A(to)| > 0, then the curve y is uniquely defined in some neigh- 
borhood of the point (to). 


, then there is no finite curve satisfying the above condi- 


Proof. In polar coordinates (r,@) with center at O, the curve y is defined by the 
following system of differential equations: 


PO=A, 
P+rP@ =. 


This system is equivalent to the union of the following two systems: 


By the main theorem of ordinary differential equations (e.g., see [10]), each of these 
two systems has a finite solution if and only if |r| > |A| > 0. 


15.3.2 Curves and broken lines 


There are many interesting questions related to the convergence of broken lines to 
smooth curves in the context of areal and angular densities. Let us first show one 
particular result in this direction. 

Let y(t) be a curve with arc-length parameter t € [0,7] and densities A(t) and 
B(t). In addition we suppose that the vectors Oy(t) and Y(t) are linearly independent 
for all admissible values of t. For a positive integer n we denote by y, the broken 
line Von... Van with vertices 


van r(i7) 


Suppose that the LLS sequence of 7; is (aon, ve ,42nn)- Consider the following two 
functions A, and B,: 


An(t) = 4o\nt/T)41n ANd Bat) = ag) n/n 
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These functions satisfy the following convergence theorem. 


Theorem 15.22. Let y be a C?-curve with arc-length parameterization. Then the 
sequence of functions (An) converges pointwise to the function A, and the sequence 
of functions (B,) converges pointwise to the function B. 


Proof. This follows directly from the definition of density functions and the prop- 
erties of LLS sequences shown in Proposition 15.5. 


We continue with two open problems. The first one is in some sense an inverse 
problem to the statement of Theorem 1.16. 


Problem 9. Let a sequence of broken lines L; converge pointwise to a C?-curve 7. 
Study the relations between LLS sequences of the broken lines L; and their “limits” 
from one side and the density functions of y from the other side. 


Notice that it is not clear what kind of limits for the elements of LLS sequence 
one should use here. 


Finally, it is interesting to have some criterion for the curve to be closed, as was 
done for broken lines in Corollary 15.15. 


Problem 10. What are the conditions on the functions A(t) and B(t) for the resulting 
curve Y to be closed? 


15.3.3 Some examples 


Let us write down the areal and angular densities for straight lines, ellipses, and 
logarithmic spirals. 


Lines. Let O be the origin and y the line x = a. Then we have 


A(t) =a and B(t) =0. 


Ellipses and their centers. Consider the ellipse xs + y = 1 witha >b>0 and 
let O be the origin, i.e., at the symmetry center of the ellipse. Then we have the 
following expressions for the areal and angular densities: 


ab 1 


A(t) = and B(t) = : 
Va? sin2t + b2 cost ab\/ a2 sin2t + b2cos2t 


Notice that the ratio of the functions A(t) and B(t) is constant: 


BD 


BO) re 
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Ellipses and their foci. Consider again the ellipse % + s =1witha>b>0. Now 


let O be the point (— Va? — b*,0), which is one of the foci of the ellipse. Then the 
areal and the angular densities are as follows: 


A() = ab+bV/a —b cost 


Va@2sin2t + b2cos2t 


and 
a 


7 by/@ sin? + b2.cos?t(a+costVa2—b?)” 


B(t) 


Remark on Keplerian planetary motion. Consider the Sun and some planet that 
moves around the Sun. Then the planet moves according to Kepler’s laws: 


I. The orbit of the planet is an ellipse with the Sun at one of the two foci of the 
ellipse. 

II. The motion has constant sector area velocity. 

III. The square of the orbital period of the planet is proportional to the cube of the 
semimajor axis of its orbit. 


The trajectory of the planet is an ellipse defined by the equation xs + ie =1witha> 
b > 0 in some Euclidean coordinates (according to Kepler’s first law), in addition, 
the Sun is one of the foci. Then according to Kepler’s second law, the planet moves 
with velocity A /A(t) for all t. The constant A is defined from Kepler’s third law: 


where by T we denote the orbital period of our planet, and by 7, and a¢, respectively, 
the orbital period and the semimajor axis for Earth. Denote by L the length of the 
ellipse for the planet, i.e., 


n/2 
b2 
sada f ft—(1- 2) cote 
a 


0 


Since T = |A| fi |1/A(t)|dt, we have 


3 


SHO ae 


Similar situations hold for parabolic and hyperbolic motions. 


Logarithmic spirals. Consider the logarithmic spiral 


{ (ae” cost,ae™ sint) |t € R}. 


15.4 Exercises 193 


Then the areal and angular densities are expressed as follows: 


bt ~3bt . /p2 

Vb-+1 

A(t) = “<— and B(t)= a Se 
Vbe+1 a 


Observe that ; 


A} (t)B(t) = —— 
(NBO) = po, 
meaning that for logarithmic spirals the function A>B is constant. 


Remark 15.23. When we consider lines, ellipses, and spirals we have certain re- 
lations between densities A and B: the functions A, A/B, and A>B respectively are 
constant. Notice that if A2B is a constant function, then the curvature is constant, 
and hence we get circles. It is natural to study what kind of curves are defined by 
certain relations between the densities. For instance, the following question remains 
open: what curves do we get if AB (or simply B) is constant? 


15.4 Exercises 


Exercise 15.1. Find a proof of Proposition 15.8. 


Exercise 15.2. Draw a broken line on five vertices whose LLS sequence satisfies 
(a) all elements are positive; (b) all elements are negative; (c) even elements 
are positive and odd elements are negative; (d) odd elements are positive and 
even elements are negative. 


Exercise 15.3. Calculate the areal and angular densities for 
(a) straight lines; (b) ellipses; (c) logarithmic spirals. 


Check for 
updates | 


Chapter 16 
Extended Integer Angles and Their Summation 


Let us start with the following question. Suppose that we have arbitrary numbers a, 
b, and c satisfying 
a+b=c. 


How do we calculate the continued fraction for c knowing the continued fractions 
for a and b? It turns out that this question is not a natural question within the theory 
of continued fractions. The simplest algorithm works as follows: 

— first calculate the rational number representations for a and b; 

— add them according to basic school arithmetic; 

— write the continued fraction for the sum. 
One can hardly imagine any law to write the continued fraction for the sum directly. 
The main obstacle here is that the summation of rational numbers does not have 
a geometric explanation in terms of the integer lattice. In this chapter we propose 
to consider a “geometric summation” of continued fractions, which we consider a 
summation of integer angles. 


In Section 16.1 we start with the notion of extended integer angles. These an- 
gles are the integer analogues of Euclidean angles of the type kz + @ for arbitrary 
integers k. We classify extended angles by writing normal forms representing all of 
them. Finally, we define the M-sums of extended angles and integer angles. Fur- 
ther, in Section 16.2, we show how the continued fractions of extended angles are 
expressed in terms of the corresponding normal forms. Finally, in Section 16.3 we 
give a proof of Theorem 6.9(i) on the sum of integer angles in integer triangles, 
which is based on the techniques introduced in this chapter. 


16.1 Extension of integer angles. Notion of sums of integer angles 


We start with a few general definitions. An integer affine transformation of the plane 
is said to be proper if it preserves the orientation of the plane. We say that two 
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sets S; and Sj are proper integer congruent if there exists a proper integer affine 
transformation of R? taking the set S; to the set S2, which we write as S$; =S>. 


16.1.1 Extended integer angles and revolution number 


16.1.1.1 Equivalence classes of integer oriented broken lines and the 
corresponding extended angles 


We say that an integer oriented broken line A,Ay_1An—2...Ao is inverse to the inte- 
ger oriented broken line AjpA1A2...An 


Definition 16.1. Consider an integer point V. Two integer oriented V-broken lines 
1, and I (see Definition 15.1) are said to be equivalent if their first and their last 
vertices coincide respectively and the closed broken line generated by /; and the 
inverse of /» is contractible in R* \ {V}. 


Definition 16.2. The equivalence class of integer oriented V-broken lines contain- 
ing a broken line AgA;...A, is said to be the extended integer angle for the bro- 
ken line AgA,...Ay with vertex V (or for short, extended angle). We denote it by 
Z(V,A0A1.--An)- 


The set of extended angles is invariant with respect to the action of (proper) 
integer affine transformations. Therefore, (proper) integer congruence for extended 
angles is well defined. 


16.1.1.2 Revolution numbers for extended angles. 


Let v be an arbitrary vector, and V an integer point. Consider the ray 
r={V+Ap|A > 0}. 


Let AB be an arbitrary (oriented from A to B) segment not contained in r. Suppose 
also that V is not in AB. We denote by #(7, AB) the number 


0, AB does not intersect r, 
#(r,AB) = }sen(A(A+»)B) , ABN re {A,B}, 
sen(A(A+v)B) , ABO r€ AB\ {A,B}, 


and call it the intersection number of the ray r and the segment AB. 


Definition 16.3. Let AoA; ...A, be a V-broken line, and let r be aray {V+Av|A > 
0}. We call the number 


n 


y) #(7,4;-14)) 


i=l 
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the intersection number of the ray r the V-broken line AoA; ...A, and denote it by 
#(r,A0A, ...An). 
Definition 16.4. Consider an arbitrary extended angle 7(V,A9A 1 ...A;,). Define 


re ={VHAVAg|A>O} and = r_ = {V—AVAg|A>O}. 


The number 


1 
5 (#(r4.,A0A1-.-An) +#(r_,AoA1..-An)) 


is called the revolution number for the extended angle 7(V,AgA,...Ay). We denote 
it by #(Z(V,A0A1...An)). Additionally, we put by definition #(Z(V,Ao)) = 0. 


Example 16.5. Let O = (0,0), A = (1,0), B = (0,1), and C = (—1, —1). Then 
1 
#(Z(0,A))=0, — #(2(0,AB)) = 7, 
3 
#(Z(O,ABCA)) = 1, #(2(O,ACB)) = ae 
Let us show that the definition of the revolution number is well defined. 
Proposition 16.6. The revolution number of any extended angle is well defined. 
Proof. Consider an arbitrary V-broken line and the corresponding extended angle 
Z(V,A0A1..-An). Let 
re ={V+AVA|A > 0} and r_={V—AVAd|A > 0}. 


By definition, every segment of the broken line AgA, ...A,, is at unit integer distance 
from V. Hence this broken line does not contain V, and the rays r, and r_ do not 
contain edges of the broken line. 


Suppose that 
Z(V,AoA1.--An) = Z(V',ApA) ee Al). 


Let us show that 
#(Z(V,A0A1.--An)) = #(Z(V' ADA eA is 


From the definition we have V = V’, Ag = AO, An = A), and the broken line 
AoA ..-AnA!,_1...A4Ap is contractible in R? \ {V}. This implies that 
HAW aod ..-An)) —#(Z(V,' AGA} ---Aln)) 
— 5 (#(r4 AoA . AnAl,_1-.-A{AQ) +#(r_,A0AL.--AnAjn_1 ---A}A0)) 
=04+0=0 


(we leave to the reader the proof of the second equality; see Problem 16.2). Hence, 


#(Z(V,A0A1..-An)) =#(Z(V’,ApA1..-Am))- 
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Therefore, the revolution number of any extended angle is well defined. 


Remark. Notice that the revolution number of a closed V-broken line coincides with 
the degree of a point V with respect to the V-broken line. 
Let us formulate one important property of the revolution number. 


Proposition 16.7. The revolution number of extended angles is invariant under 
proper integer congruences. 


Finally, we formulate a nice expression for the rotation number of a closed inte- 
ger broken line with integer vertices not passing through the origin introduced in a 
recent preprint [86] by A. Higashitani and M. Masuda. Recall that the rotation num- 
ber about the integer point V of a closed broken line L not passing through V is the 
degree of the projection of L to the unit circle along the rays with vertex at V. We 
denote the rotation number by Roty (L). Notice that a rotation number for V-broken 
lines coincides with the revolution number of the corresponding extended angles. 


Proposition 16.8. Consider a closed integer V-broken line L and enumerate all 
of its integer points A,,...,Aq (not only vertices). In addition, set Ay = Aq and 
Ag41 =A}. Then we have 


1 1. 
Roty (L) = 4 Yat 12 yi ai, 
i=l i=l 


where €; = det(VA;, VA;+1) and a; is defined from the equation 


€;-1VAj_-1 + &\VAj41 +a;VA; = 0. 


For the proof of this proposition we refer to [86] and [226]. 


16.1.1.3 Zero integer angles 


In the next theorem we use zero integer angles and their trigonometric functions. 
Let A, B, and C be three integer points on a straight line. Suppose that B is distinct 
from A and C and that it is not between A and C. We say that the integer angle ZABC 
with vertex at B is zero. Further, we put by definition 


Isin(ZABC) =0, Icos(ZABC)=1,  Itan(ZABC) =0. 


Denote by larctan(0) the angle ZAOA, where A = (1,0) and O is the origin. 
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rg) nA one is z 
Typel | °°: -———— Z(0,Ao) = 0x + larctan(0), 
: LSLS sequence is (). 


Tye I: Az’ ed eae oias! Z(O,AoA1A2) = 1 + larctan(0), 
sf ..Q.Agp... LSLS sequence is (1,—2, 1). 
“Ag 


ane a ‘ “Age Ay Z(0,AoA1A2A3Ag) = 27 + larctan((0), 
a . . . . |LSLS sequence is (—1,2,—1,2,—1,2,-1). 


Z(0,A0A1A2A43A4) = @ + larctan 3, 


Type Vis LSLS sequence is (1,—2,1,—2,1, 1,1). 


Z(0,A0A1A2A3) = —7-+larctan3, 


DreNG LSLS sequence is (—1,2,—1,2,3). 


Fig. 16.1 Examples of normal forms. 


16.1.2 On normal forms of extended angles 


While working with an abstract definition of extended angles it is useful to keep in 
mind particular broken lines that characterize the corresponding equivalence classes, 
which we shall call the normal forms of these broken lines. 

Denote a sequence 


(d0,-++;€ny;Q0,++-;4n, -++5  0;-++,4n,bo,---,;Dm). 
—_—_— 
k times 


by 
{ (iiss Sasi) 5 Dowveveala) 


Definition 16.9. (Normal forms of extended angles.) Consider an integer oriented 
O-broken line AgA; ...As, where O is the origin. Let Ap = (1,0) and (if s > 0) let Ay 
be on the straight line x = 1. 

(I) We say that the extended angle 7(O,Ao) is of Type I and denote it by Oz 4 
larctan(0) (or 0, for short). The empty sequence is said to be characteristic for 0 + 
larctan(0). 


T 


If the LSLS sequence of the broken line AgA, ...As coincides with one of the follow- 
ing sequences (we call it the characteristic sequence for the corresponding angle), 
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then: 

(II,) If ((1,—2,1,-2)*-!,1,-2,1), where k > 1, then we denote the angle Bo by 
ka-+larctan(0) (or kz, for short) and say that Bo is of Type Ik; 

(III;,) If ((—1,2,-1,2)*"!,-1,2,-1), where k > 1, then we denote the angle Bp 
by —ka-+larctan(0) (or —kz, for short) and say that Bo is of Type TM; 

(IV,) If ((1,-2,1,—2)*,ao,...,a2n), where k > 0, n > 0, a; > 0, for i=0,...,2n, 


then we denote the angle ®p by km+ larctan([ao; a1 : +++: d2n]) and say that Bp is of 
Type IV;; 

(V;) If ((—1,2,-1,2)*,ao,...,@2n), where k > 0, n > 0, a; > 0, for i =0,...,2n, 
then we denote the angle ®) by —ka+larctan([ag; aq : +++ : d2,]) and say that Bp is 
of Type Vx. 


See examples of several normal forms in Fig. 16.1. 


Theorem 16.10. For every extended angle ® there exists a unique normal angle 
of Definition 16.9 that is proper integer congruent to ®. (This angle is called the 
normal form of ®.) 


We start the proof with the following lemma. 


Lemma 16.11. Consider integers m, k > 1, and a; > 0 fori=0,...,2n. 
(i) Suppose that the LSLS sequences for the extended angles ®, and ®y are respec- 
tively 
((1,—2,1,-2)*-',1,-2,1,-2,a9,...,@2n) and 
((1,—2,1,-2)*-',1,-2,1,m,ao,...,@2n). 


Then ®, is proper integer congruent to Py. 
(ii) Suppose that the LSLS sequences for the extended angles B, and ® are respec- 
tively 
((—1,2,—1,2)*-',-1,2,-1,m,ao,...,d2n) and 
((—1,2,-1,2)4-!, -1,2,-1,2,a0,...,@2n). 


Then ®, is proper integer congruent to Py. 


Proof. We start the proof with the first statement of the lemma. Without loss of 
generality we assume that the vertices of the extended angles ©; and ®) are at the 
origin, say 

D, = Z(0,Ao..-Ark+n+1); 

D, = Z(O, Bo... Box n 1) 


Additionally we assume that Ag = Bo = (1,0) and the points A; and Bz are on the 
lines x = 1 and (m+ 1)y = x respectively. These conditions together with the LSLS 
sequences uniquely identify the vertices 


Ax = ((-1)!,0), for 1 <k—1, 
Aoi41 = ((-1)',(-1))), for? < k-1, 
Ax = ((-1)f,0), 

Ari = ((—1)*, (—1)*ap), 


and 
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By, = ((-1)/,0), for 1 <k—1, 
Boi41 = ((-1)!(—m—1),(-1)'), for! < k-1, 
By = ((-1)*,0), 

Bogs = ((-1)*, (-1)fao). 


Hence Ar, = Boy and A2x41 = Box41. Notice that the remaining parts of both LSLS 
sequences (i.e., (49,.--,@2,)) coincide. Therefore, the point A; coincides with the 
point B, for / > 2k. 

On the one hand, it is clear that the integer oriented broken line 


Ao... Arg (=Bx)By_1Be_2 ...Bo 


is contractible. On the other hand, A; = B; for 1 > 2k. Therefore, the angle ®, is 
proper integer congruent to @). This concludes the proof of Lemma 16.11(i). 


The proof of Lemma 16.11(ii) is similar, and we leave it as an exercise for the 
reader. 


Proof of Theorem 16.10. Uniqueness of normal forms. Let us first show that the 
extended angles listed in Definition 16.9 are pairwise proper integer noncongruent. 


Consider the revolution numbers of the extended angles listed in Definition 16.9: 


Types TM, (k > 0)[ MM, (k = 0) JV; (k = 0) Vi(k > 0) 
Revolution numbers||0| 1 /2(k+1) |—1/2(k+1)} 1/44+1/2k|1/4—1/2k 


Therefore, the revolution numbers of extended angles distinguish the types of the 
angles. 


For Types I, I;, and II, the proof of uniqueness is complete, since any such 
type contains exactly one extended angle. 


Now we prove that any two normal forms of Type IV; are not proper integer 
congruent for any integer k > 0. Consider the extended angle 


® = ka-+larctan([ao;a1 : +++ : azn]) 


of Definition 16.9. Assume that the O-broken line for the angle ® is AoA, ...Am, 
where m = 2|k|+-n+1. Notice that the LSLS sequence for this broken line is char- 
acteristic for @. We consider the two cases of odd and even k separately. 

If k is even, then the integer angle ZAjOA,, is proper integer congruent to the 
integer angle larctan((ao;a1 : --- : do,|). The integer arctangent is a proper inte- 
ger affine invariant for ®. This invariant distinguishes all the extended angles of 
Type IV; with even k. 

Suppose now that k is odd. Set B = O+A00. The integer angle BVA, is proper 
integer congruent to larctan({ag;a, : --- : do,]). The integer arctangent is a proper 
integer affine invariant for the extended angle ®, distinguishing extended angles of 
Type IV; with odd k. 
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The proof of uniqueness of normal forms of Type V; repeats the proof for normal 
forms of Type IV;. 


Therefore, the extended angles listed in Definition 16.9 are not proper integer 
congruent. 


Existence of normal forms. Now we prove that every extended angle is proper inte- 
ger congruent to one of the normal forms. 

Consider an arbitrary extended angle ® = Z(V,A0A1...An). If #(®) = k/2 for 
some integer k, then ® is proper integer congruent to an angle of one of the 
Types I—III. If #(®) = 1/4, then the extended angle @ is proper integer congruent 
to the extended angle defined by the sail of the integer angle ZAgVA,, of Type IVo. 


Suppose now that #(®) = 1/4+k/2 for some positive integer k. Choose the bro- 
ken line defining ® whose LSLS sequence is of the following form: 


((1,-2,1,-2)*1,1,-2,1,m,ao,...,4@2n), (16.1) 


where a; > 0, for i=0,...,2n. Notice that it is always possible to construct such a 
broken line: first, construct the broken line for 


20 429) Al 
( ) 


Secondly, take the sail of the corresponding angle. The union of the broken line 
and the sail is a broken line whose LSLS sequence is exactly as in (16.1). By 
Lemma 16.11 the extended angle ® defined by such an LSLS sequence is proper 
integer congruent to the extended angle of Type IV; defined by the sequence 


((1,-2,1,-2)*-',1,-2,1,-2,ao,...,@2n), 


which is in the list of normal forms. 


Finally, we consider the case #(®) = 1/4—k/2 for some positive integer k. As in 
the previous case there exists a broken line defining @ whose LSLS sequence is 


((-1,2,-1,2)*1,-1,2,-1,m,ao,...,a2n), 


where a; > 0, fori =0,...,2n. By Lemma 16.11 the extended angle defined by this 
sequence is proper integer congruent to the extended angle of Type V; defined by 
the sequence 

((—1,2,-1,2)*',-1,2,-1,2,a0,...,42n), 


which is again in the list of normal forms. 
This completes the proof of Theorem 16.10. 
Let us reformulate Theorem 16.10 in the following way. 


Corollary 16.12. Two extended angles are proper integer congruent if and only if 
they have the same normal form. 
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16.1.3 Trigonometry of extended angles. Associated integer angles 


Having the list of normal forms, we can extend the trigonometric functions to the 
case of extended integer angles. 


Definition 16.13. Consider an extended angle ® with the normal form kz+@ for 
some integer (possible zero) angle @ and for an integer k. 

(i) The integer angle @ is said to be associated with the extended angle ®&. 

(ii) The numbers Itan(@), Isin(@), and lcos(@) are called the integer tangent, the 
integer sine, and the integer cosine of the extended angle ®. 


There exists a canonical embedding of the set of all integer angles into the set of 
all extended angles, since every sail is an integer broken line at distance one from 
the vertex of the angle. 


Definition 16.14. Consider an arbitrary integer angle @. The angle 
Ox + larctan(Itan @) 


is said to be corresponding to the angle ~, which we denote by @. 


From Theorem 16.10 it follows that for every integer angle @ there exists a 
unique extended angle @ corresponding to @. Therefore, two integer angles @; and 
(2 are proper integer congruent if and only if the corresponding extended angles @, 
and @, are proper integer congruent. 


16.1.4 Opposite extended angles 


As in Euclidean geometry, in integer geometry every extended angle possesses an 
opposite angle. 


Definition 16.15. Consider an extended angle 
® = Z(V,Ao0A1..-An). 


The angle 
Z(V,AnAn—-1---Ao) 
is said to be opposite to ®, which we denote by —®. 


Let us write the normal form for the opposite extended angle. 


Proposition 16.16. For every extended angle ®&kn+@ we have 


@®2(—-k-1)4+(a-9@). 
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16.1.5 Sums of extended angles 


In the next definition we introduce sums of integer and extended angles. A sum of 
two angles is not uniquely defined, in contrast to Euclidean geometry. 


Definition 16.17. Consider arbitrary extended angles ®;, where i=1,...,/ for! > 1. 
Let the normal forms of the ®; have the characteristic sequence 


(40,1,41,i,-»-;42n;,i) fori=1,...,/. 


Let M = (m,...,mj_) be an arbitrary (J—1)-tuple of integers. Construct an ex- 
tended angle ® with characteristic sequence 


(401,411) +++ »42n,,15171,40,2, ee »42ny,2,2, “ot »M-1,40,1;-- : ,42n,,1) + 


The normal form of @ is called the M-sum of extended angles ®; (i = 1,...,/) and 
is denoted by 


1 
y @;, or equivalently by =D +m, 2 +m. +++ +m), Pr. 
Mi=1 


It is clear that M-sums of extended angles are defined in a unique way up to the 
set choice of the set M. 


Example 16.18. Let ® = Oz + larctan 1. Then 


@+_36=7-+larctanl, 
@+_.6=7-+larctan0, 
@+_,6=07-+larctan 1, 
@+o 6 = 07 + Ilarctan2, 
@+, 6 = 07 +Ilarctan 3. 


It is interesting to observe that the M-sum of extended angles is neither associa- 
tive nor commutative. 


Proposition 16.19. The M-sum of extended angles is nonassociative. 


Proof. For example, let 
®, = Ox +larctan 2, 
@, = 0a + larctan 3, 
®; = Oz + larctan5. 


Then 
@, +_, &) +_, & = 7+ larctan4, 
@, +_; (@. +_; ®3) = 2m + larctan0, 
(BH, +_; By) +_; B; = On + Ilarctan 1. 
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Proposition 16.20. The M-sum of extended angles is noncommutative. 
Proof. For example, let 

®, =Oxz-+larctanl! and ®,=O0z-+larctan - 


Then 3 B 
®, +; ®) = On + larctan a # Ox + larctan sap D+, @). 


Remark 16.21. The definition of M-sums of extended angles is canonically lifted to 
the case of classes of proper integer congruences of extended angles. 


16.1.6 Sums of integer angles 


We conclude this section with the definition of M-sums of integer angles. 


Definition 16.22. Consider integer angles a, where i= 1,...,/ for some / > 2. Let 
@; be the corresponding extended angles for o;, and let M = (mj,...,mj_) be an 
arbitrary (/—1)-tuple of integers. The integer angle @ associated with the extended 
angle 

p= ay +m a2 +n, wees +i, Qi. 


is called the M-sum of integer angles a; (i= 1,...,/) and denoted by 


l 
1D @;, orequivalently by 1 +m, 2 +m +++ +i, O- 
Mi=1 


Remark 16.23. The definition of M-sums of integer angles is also canonically lifted 
to the case of classes of proper integer congruences of integer angles. 


16.2 Relations between extended and integer angles 


Recall that |r| denotes the maximal integer not greater than r. 


Theorem 16.24. Consider an extended angle ® = Z(V,AgA,...An). Suppose that 
the normal form for ® is kt+@ for some pair (k,@). Let (ao,a1,.--,@2n—2) be the 
LSLS sequence for the integer oriented broken line AjA ... An. Suppose that 

P 


[ao34y 3 +++ 2 Agn-2] = —. 
q 


Then the following hold: 
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larctan 1, if p/q=-, 
larctan a if p/q>], 
I 
ox) Branton = f<P/a<1, 
0, ; if p/q =, 
m — larctan o—Tg-yemia f —1<P/a<0, 
m — larctan(—f ; if p/q<—1. 


Proof. Without loss of generality we assume that V is the origin O, Ag = (1,0), and 
1 
Ag + — sgn(AgO’A1)AoA, = (1,1). 
ao 


(One can get this after a certain integer affine transformation of the plane.) 
By Theorem 3.1 the coordinates of the point A, are (q,p). This directly implies 
the statement of the theorem for the cases p>q>0, p/q = 0, and p/q = ~. 
Suppose now that g>p>0. Consider the integer angle @ = ZAoPA,. Let Bo...Bm 
be the sail for @. Direct calculation shows that the point 


coincides with the point (1+|(q¢—1)/p|,1). Consider the proper integer linear 
transformation T satisfying the following two conditions: first, T takes the point 
Ao = Bo to itself, and second, T takes the point D to (1,1)’. These two conditions 


uniquely identify T as 
avs ( -L(g viel) 


The transformation T takes the point A, = Bm, with coordinates (gq, p), to the point 
with coordinates (q—|(p—1)/p|p,p)’. Therefore, 


P 
@ = larctan ——_———_____ 
q-\(q-1)/P|p 

The proof for the case p > 0 and q < 0 repeats the proof described above after 
switching to the corresponding adjacent angles. 

Finally, the case with p < 0 is centrally symmetric with respect to all the cases 
considered above. This concludes the proof of Theorem 16.24. 


16.3 Proof of Theorem 6.9(i) 


Now we return to the proof of the first statement of the theorem on sums of integer 
tangents for integer angles in integer triangles: 
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Yee 


Fig. 16.2 For the given triangle AABC we have If; (AB;C) =5. 


Theorem 6.9(i). On sums of integer tangents of angles in integer triangles. 
Let (0, Oz, 03) be an ordered triple of angles. There exists a triangle with consecu- 
tive angles integer congruent to 0, A, and Qs if and only if there exists i € {1,2,3} 
such that the angles a = a, B = Qi41(mod3)> Y = O+2(mod3) Satisfy the following 
conditions: 
(a) for € =]ltana, —1,lItan B[ the following holds & <0, or € > Itana, or € =9; 
(b) ]ltana,—1,ltanB, —1,ltan y[= 0. 


16.3.1 Two preliminary lemmas 


We begin the proof of Theorem 6.9(i) with two preliminary lemmas. 


Definition 16.25. Let AABC be an integer triangle. Consider all integer points in 
the interior of AABC and on its sides lying at unit integer distance from the side AB. 
We denote their number by 1¢;(AB;C) (see Fig. 16.2). 


Recall that all integer points at unit integer distance from AB described in the 
definition are contained in one straight line parallel to AB. Besides that, 


0 <1¢;(AB;C) < 1¢(AB). 
Now we prove the following lemma. 
Lemma 16.26. For every integer triangle AABC the following holds: 
ZCAB +1¢(4B)—10,(4B:Cc)-1 ZABC +e(Bc)—10 (Bc:A)-1 ZBCA = 1 
(where by @ we denote the extended angle corresponding to @). 


Proof. Consider an arbitrary integer triangle AABC. Suppose that the pair of vectors 
(BA, BC) defines the positive orientation of the plane (otherwise, we consider the 
triangle AACB). Set 


D=A+CB and E=A+CA 
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(see Fig. 16.4 on page 210). 
Since CADB is a parallelogram, we have 


ABAD& AABC, 


and hence 
ZBAD=ZABC and 1¢;(BA;D) =1¢;(AB;C). 


Since EABD is a parallelogram, we get 


AED&ABAD& AABC. 


Therefore, 
ZDAE&ZBCA and 1¢;(DA;E) =1¢;(BC;A). 


Let Ao ...An be the sail of the angle ZCAB with the corresponding LLS sequence 
(ao, Arar ,2n-2). 


Further, let BoB, ...B), be the sail of ZBAD (where Bo = A,) with the corresponding 
LLS sequence 
(bo, ..-,Dom—2)- 


Finally, let CoC; ...C; be the sail of ZDAE (where Co = B,,) with the corresponding 
LLS sequence 
(co, uae ,C21-2)- 
Consider now the A-broken line 
Ag..-AynB,B2...BmCiCo...Cy. 
The LSLS sequence for this broken line is 
(ao, ee ,A2n—2,t, bo, RAS; b2m~2, u,co, Ges ,C21-2), 


where ¢ and u are certain integers. By the definition of the M-sum of extended angles 
this sequence defines the extended angle 


ZCAB +, ZBAD +, ZDAE, 


which is equal to 7 by construction. 


Now let us compute t. Denote by A/, the integer point in the segment A,_ 1A, that 
is next to A,. Consider the set of integer points at unit integer distance from AB and 
lying in the half-plane with AB at the boundary and containing the point D. This set 
coincides with the following set (see an example in Fig. 16.3): 


{Ane =An +AjAn +kAA,|k € Z}. 
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/ 


“An,-1=A+4), An Ff An=Bo_ 


Fig. 16.3 Integer points A, ;. 


Denote by 7 the open half-plane bounded by the straight line AC and containing 
the point B. Let also z_ denote the complement of 77, in the plane. Since 


An,-2 =A+A)A, 


the integer points A, , for k less than or equal to —2 are in the closed half-plane 7_. 
Since 
An,-1 =A+A)An, 


the integer points A, , for k greater than or equal to —1 are in the open half-plane 
TT. 

The intersection of the parallelogram AE DB and the open half-plane 7, contains 
exactly 1¢(AB) points of the described set: these points are the points A, with —1 < 
k < 1¢(AB)-—2. 

Since 

ABAD& AABC, 


the number of points A,, in the closed triangle ABAD is 1;(AB;C): these points 
are exactly the points A, x with k satisfying 
1¢(AB) —1¢,(AB;C) —1 <k <1€(AB) —2. 
Denote by ko the integer 
1¢(AB)—1¢,(AB;C)-1. 


The point A, ;, 1s contained in the segment BoB, of the sail of the integer angle 
ZBAD (see Fig. 16.4). Since 
ZBAD& ZABC, 


we have 
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Fig. 16.4 The point Anko: 


t = sgn(An_1A4A,) sgn(A,AB)) sgn(An—1A,B1) sin ZAy_1A,Bi 
= 1-1-sgn(An—1AnAn jk) sin ZAn—1AnAn,ko 
= sign(ko)|ko| = ko = 1¢(AB) _ 1e, (AB;C) —1. 
For the same reason, 


u = 1é(DA) —1é,(DA;E) — 1 = 1¢(BC) —1;(BC;A) — 1. 


Therefore, 


ZCAB +1¢(4B)—10, (AB:C)—1 ZABC +1¢(BC)—10; (Bc:A)—1 BCA = 1. 


The proof is complete. 


Lemma 16.27. Let a, B, and y be nonzero integer angles. Suppose that 
a +u B +y Y =T. 


Then there exists a triangle with three consecutive integer angles proper integer 
congruent to a, B, and Y respectively. 


Proof. Let 
O = (0,0), A=(1,0), and D=(-1,0). 


Choose the integer points 
B=(qi,p1) and C= (q2,p2) 


with integers p;, p2 and positive integers qg1, gz such that 


ZAOB = larctan(Itana) and ZAOC = ZAOB+, 8. 


From the construction, the pair of vectors (OB, OC) defines the positive orientation, 
and ZBOC 2B. Since 
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O@+yB+yvy=a and @+,B=ZAOC, 


we have ZCOD&y. 
Define 
B'=(qip2,pip2) and C = (q2p1,pip2) 


and consider the triangle AB’OC’. The coincidence of angles 
ZB'OC' = ZBOC 
implies 
ZB'0OC' 2B. 


Since B # 0, the points B’ and C’ are distinct and the straight line B’C’ does not 
coincide with the straight line OA. Since the second coordinates of both points B’ 
and C’ equal q1q2, the straight line B’C’ is parallel to the straight line OA. Therefore, 
by Proposition 5.23 we have 


ZC B'O= ZAOB' = ZAOB=a and ZOC'B' = ZC'OD = ZCOD = yj. 


So, the consecutive integer angles of the triangle AB’OC’ are integer congruent 
to a, B, and y. 


16.3.2 Conclusion of the proof of Theorem 6.9(i) 


Let a, B, and y be nonzero integer angles satisfying the conditions (a) and (b) of 
Theorem 6.9(i). From the second condition 


]Itan(a), —1,Itan(B), —1,ltan(y) [= 0, 
it follows that _ 
@+14B+i17=ka. 


Since all three tangents are positive, we have either k = 1 or k = 2. 
By the first condition, 


|ltan a, —1,ltan B[< 0 or _—_ | iltana@,—1,ltanB[> Itana. 


Hence _ 

@+_1B =07+4@, 
for some integer angle @, and hence k < |. Together with the above, this implies 
that k = 1. Therefore, by Lemma 16.27 there exists a triangle with three consecutive 
integer angles integer congruent to a, B, and y, respectively. 


Let us prove the converse statement. First we prove condition (b) of Theo- 
rem 6.9(i). We do it by reductio ad absurdum. Suppose that there exists a triangle 
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AABC with consecutive integer angles 
a = ZCAB, B=ZABC, and y=ZBCA, 
such that 
|ltan a, —1,ltan B, —1,ltan y[ 4 0, 


|ltan B, —1,ltan y, —1,ltana[ 4 0, 
|ltan y, —1,ltana, —1,ltanB[ + 0. 


This system of inequalities and Lemma 16.26 imply that at least two of the integers 
1¢(AB)—1€;(AB;C)—1, 1€(BC)—1é,;(BC;A)—1, and 1¢(CA)—1¢;(CA;B)—1 


are nonnegative. 
Without loss of generality we suppose that 


1¢(AB) —1¢;(AB;C) — 1 > 0, 
1é(BC) —10,(BC;A) —1 > 0. 


On the one hand, since all integers of the continued fraction 
r =| ltan(a),1¢(AB) —1¢;(AB;C) — 1,ltan(B),1é(BC) — 14; (BC; A) — 1, Itan(y)| 


are nonnegative and the last one is positive, either r > 0 or r = oe. On the other hand, 
by Lemma 16.26 and by Theorem 16.24, we have that r = 0/—1 = 0. We arrive at a 
contradiction. Hence the second condition always holds. 


Now we prove that condition (a) holds. Consider a triangle AABC with consec- 
utive integer angles 


a = ZCAB, B=ZABC, and y=ZBCA. 
We have proven the second condition, so without loss of generality we assume that 
|Itan(a), —1,Itan(B),—1,ltan(y) [= 0. 


Since _ 

a+_1B +1V= 1, 
we have _ 

@+_18 =0x+@ 


for some integer angle @. The last expression for @ implies the first condition of the 
theorem. 
This concludes the proof of Theorem 6.9(i). 
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16.4 Exercises 


Exercise 16.1. Find the revolution number of the O-broken line with vertices 
Ao = (1,0), A,(—1,2), Ao = (0,—1), A3 = (1,1), Aq = (1,—2). 


Find the normal form of the extended angle 7(O0,A9A1A2A3A,). 


Exercise 16.2. Let AgA;...A, be a closed V-broken line. Suppose that it is con- 
tractible in R? \ {V}. Prove that for every ray r with vertex at V we have 


#(r,AoA, ...An) =0. 


Exercise 16.3. Find a proof of Lemma 16.11 (ii). 


Exercise 16.4. (a). Prove that every extended angle possesses a unique opposite 
extended angle. 

(b). Prove that the M-sum is uniquely defined by the ordered sequence of extended 
angles and the set of integers M. 


Exercise 16.5. Calculate the normal forms for the following angles 
(2a +larctan1) +_, (3a +larctan 3); 


(4x + larctan 3) +o (3% +larctan 3); 
(—5a + larctan 119) +2 (37 4+ larctan 3). 


Exercise 16.6. Find an example of a triple of integer angles that are not the angles 
of an integer triangle. 


Exercise 16.7. Is it true that for an arbitrary pair of integer angles (a, 8) there exists 
an integer angle y such that these three angles are angles of some integer triangle? 


| ® 
Check for 
updates 


Chapter 17 


Integer Angles of Polygons and Global Relations 
for Toric Singularities 


In Chapter 6 we proved a necessary and sufficient criterion for a triple of inte- 
ger angles to be the angles of some integer triangle. In this chapter we prove the 
analogous statement for the integer angles of convex polygons. Further, we discuss 
an application of these two statements to the theory of complex projective toric 
surfaces. We refer the reader to the general theory of toric surfaces in the works 
of V.I. Danilov [49], G. Ewald [61], W. Fulton [66], T. Oda [162], and A. Tre- 
visan [214]. In this book we do not consider the multidimensional case (we refer the 
interested reader to the paper of H. Tsuchihashi [215)). 

In Section 17.1 we formulate and prove a theorem on integer angles of convex 
polygons. After a brief introduction of the main notions and definitions of complex 
projective toric surfaces (Section 17.2) we discuss two problems related to singular 
points of toric varieties using integer geometry techniques. 


17.1 Theorem on angles of integer convex polygons 


We start with a theorem on necessary and sufficient conditions for integer angles to 
be the angles of some convex integer polygon. 


Theorem 17.1. Let (Q1,...,Q,) be an arbitrary ordered n-tuple of nonzero integer 
angles. Then the following two conditions are equivalent: 

(i) there exists a convex n-gon with consecutive integer angles 0),..., Qn; 

(ii) there exists a sequence of integers (m,,...,M,—1) such that 


T— Qj +m, ates +iny- T—A, = 20. 


Proof. Condition (i) = Condition (ii). Suppose that there exists an integer convex 
polygon A,A2...A, with prescribed consecutive integer angles Q1,..., Qn. We as- 
sume that the pair of vectors (A2A3,A2A1) defines the positive orientation of the 
plane (otherwise, we consider the polygon A,A,_1...A;). Let 
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By =O+A,A1 and Bj =O+A,_-1A; fori=2,...,n. 
Define 


p= ZB ,OBj41, ifi=1,...,n—-1, 
‘| ZB,OB,, ifi=n. 


Consider the broken line that is the union of all the consecutive sails for angles 
Bi,..., Bn. This broken line defines an extended angle proper integer congruent to 
22a+0. The LSLS sequence for this broken line contains exactly n—1 new elements 
(together with all the elements of the LLS sequences for the sails of the angles 
Bi,..-, Bn). Denoting these numbers by m1,...,/7—1, we get 


Bi +m, Tiny} Bn = 20. 
From the definition of B; (i = 1,...,n) it follows that 8B; 22—@a;. Hence 
M—Qj tiny °° +m, E—- On = 20. 


Therefore, the first condition implies the second condition. 


Condition (ii) = Condition (i). We assume that there exists M = (m,...,17p—1) 
such that 
TQ; +m at +i 1 T—On = 20. 


This equation implies the existence of integer points B, = (1,0), B; = (xi,y;), for 
i=2,...,n—1, and B, = (—1,0) such that 


B,;OB;_,=n—o;_),fori=2,...,n, and ZB,OB,2{2—Op. 


Denote by P the integer point 


n 
O+) OB. 
i=l 
Since a; 4 0, the angle 7—@; is not contained in a line (i = 1,...,m). Hence, the 
origin O is an interior point of the convex hull of the points B; for i= 1,...,k. 


Therefore, there exists an integer s such that the integer triangle AB,PB,,) contains 
the origin O and in addition, O is not in the edge B,B,,; (here B,., = B, and 
Bo =B,). This implies that 


O =A, OP +A20B;+A30B;41, 


where A; > 0, Az > 0, and Az > 0 are rational numbers. So there exist positive 
integers c; (i = 1,...,n) such that 


¥ (ci0Bi) =0. (17.1) 


Consider Ag = O and A; = Aj_| +c;OB; for i=2,...,n. Since the numbers c),...,C; 
are integers, the broken line AgA,...A, is integer. From equation (17.1) the broken 
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[| [TN 


Fig. 17.1 Two integer noncongruent polygons with proper integer congruent integer angles. 


line is closed (i.e., Ag = Ay). By construction, the integer angle at A; of this broken 
line is proper integer congruent to a (i= 1,...,n). Since c; > 0 fori=1,...,n and 
since all the vectors OB; are in counterclockwise order, the broken line is a convex 
polygon. Therefore, condition (ii) holds. 


Remark 17.2. On the one hand, Theorem 17.1 is an extension of Theorem 6.9(i). 
On the other hand, the direct generalization of Theorem 6.9(ii) is false: the integer 
angles do not uniquely determine the proper integer affine homothety types of integer 
convex polygons. See an example in Fig. 17.1. 


A completely satisfactory description of integer congruence classes of lattice 
convex polygons has not yet been found. It is known only that the number of convex 
polygons with lattice area bounded from above by n grows exponentially in n'/3 as 
n tends to infinity (see [8] and [19]). 


17.2 Toric surfaces and their singularities 


17.2.1 Definition of toric surfaces 


We start with the definition of complex projective toric surfaces associated to integer 
convex polygons. 


Definition 17.3. Consider an integer convex polygon P = AgA,...A,. Let the in- 
tersection of this (closed) polygon with the integer lattice Z* consist of the points 


B; = (x;,y;) for i=0,...,m. We enumerate the points in such a way that B; = A; for 
i=0,...,n. Denote by Q the following set in the complex projective space CP”: 
XO 40 4-XO0-YO «AL AVL g—MIAV1 ws Xm png —Xm—Y, 
{ (#1°3%*5 Diag © aed ots e CK {o}}. 


The closure of the set Q in the natural topology of CP” is called the complex pro- 
jective toric surface associated with the polygon P and denoted by Xp. 


Example 17.4. For instance, consider the following two polygons: 


P= NS and Q= La 


Then Xp = CP”, and Xg is the conic in CP? defined by the equation 
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X1X3 = X0X4. 


Notice that the integer area coincides with the degree of the resulting surface. This 
is a general fact of toric geometry. 


Remark 17.5. One can generalize the described construction to the case of poly- 
hedra of arbitrary dimensions. Notice that in the multidimensional case this con- 
struction may have nonisolated singular points. In this situation there is a standard 
procedure to resolve them (via certain ¢-processes). The resulting variety is the toric 
variety corresponding to the polyhedron. 


Remark 17.6. One of the main properties of toric surfaces (and toric varieties in 
general) is as follows. Every toric surface admits a huge group of isomorphisms. 
While here we discuss only toric surfaces, similar statements hold for toric varieties 
of arbitrary dimension. Consider a toric surface Xp, parameterized as Xp(to,t,t2). 
Consider the multiplicative group of complex numbers C* = C \ {0}. Every toric 
surface Xp admits a natural action of the group C* x C*. An arbitrary element (a,b) 
of the group C* x C* acts on the toric surface Xp as follows: 


Sa,p) : Xp — Xp, Fab) (XP(to,t,t2)) = Xp(ato, bt ,t2). 


We use this action below to find singular points of toric surfaces. 


17.2.2 Singularities of toric surfaces 


We begin with the following definition. 


Definition 17.7. Algebraic singularities of complex projective toric surfaces are 
called toric singularities. 


Let us first collect general properties of complex projective toric surfaces. 
For i=0,...,m, set 


Aj = (0: +--:0:1:0:-+-:0), 


where | stands in the ith place. 


Theorem 17.8. (i) The set Xp is a complex projective complex-two-dimensional 
surface with isolated algebraic singularities; 

(ii) the complex projective toric surface contains the points A; for i=0,...,n (where 
n+1 is the number of vertices of the convex polygon); 

(iii) the points of Xp \ {Ao,A1,.-.;An} are nonsingular; 

(iv) the point A; is singular if and only if a; % larctan1, where ; is the angle of the 
polygon P at vertex Aj; 

(v) the algebraic singularity at A; (0 < i < n) is uniquely determined by the integer 
affine type of the nonoriented sail of Qj. 
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Example 17.9. Consider the polygon 
P = 


and the corresponding toric surface Xp. The surface Xp C CP? is defined by the 
equation 
Xox 1x2 = coe 


Its singularities are the points 
(1:0:0:0), (0:1:0:0), and (0:0:1:0). 


In the appropriate affine charts these three singularities are defined by the equation 


wae. 

For the proof of Theorem 17.8 we refer to classical textbooks on toric geometry 
((49], [61], [66], [162], etc.). Nevertheless, we would like to outline some ideas 
and remarks related to the proof. The first item is a classical statement of algebraic 
geometry. 


Outline of the proof of item (ii). Consider an arbitrary vertex A; of the polygon P. 
Let J; be some strictly supporting line for the polygon at A; (i.e., J; P = {A;}). 
Consider a vector v orthogonal to /; and directed away from the polygon. Assume 
that v = (A,,A2). Consider a curve in RP” C CP” with real parameter f: 


| (orton » pxit4ay se. phn ayn) Ite R}. 


It is clear that this curve is a subset of Xp. By construction of v, the largest value of 
the exponent will be at the place i corresponding to A;. Thus the limit point of this 
curve is exactly the point A; as f tends to infinity. Therefore, the point A; belongs to 
the toric surface Xp. For a similar reason, the points An+1, Ae Am are not in Xp. 


Outline of the proof of item (iii). To detect all singular points of Xp we use the action 
of the group C* x C* introduced in Remark 17.6. Notice that the group takes sin- 
gular points to singular points and nonsingular points to nonsingular points. There 
are two-dimensional, one-dimensional, and zero-dimensional orbits of the action of 
C* x C* on Xp. Since all the singular points of Xp are isolated, they are all contained 
in the zero-dimensional orbits, i.e., at points where C* x C* acts trivially. It can be 
only the points with a single nonzero coordinate 


(O:---:0:1:0:---:0). 


The toric surface Xp contains only singular points with ones at places 0,...,n. That 
is exactly the points Ao,...,An. 
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Remarks on items (iv) and (v). As stated in Theorem 17.8(v), singularities of toric 
surfaces are in one-to-one correspondence with integer affine types of nonoriented 
sails of integer angles. Therefore, toric singularities are classified by the LLS se- 
quences of corresponding integer angles up to switching the order of the sequence. 
We call the corresponding LLS sequence the continued fractions of the toric singu- 
larity. Therefore, the pair of rational numbers 


(Itan a, Itan a) 


for an arbitrary integer angle @ is a complete invariant of the corresponding toric 
singularities, which we call the sail pair of the singularity. 

The continued fractions for toric singularities are slightly different from the 
Hirzebruch—Jung continued fractions for toric singularities (see [93] and [88]). 
The latter ones have all negative integer elements except for the first one, which 
is a positive integer. It is interesting to note that Hirzebruch—Jung continued frac- 
tions generate a sequence of o-processes that resolve the corresponding singularity. 
The relations between regular continued fractions and Hirzebruch—Jung continued 
fractions are described in [182]. 


17.3 Relations on toric singularities of surfaces 


In this subsection we study global conditions on complex projective toric surface 
singularities associated to n-gons (in algebraic language, n is understood as the Eu- 
ler characteristic of the corresponding toric surface; see, for instance, [214]). In 
Section 17.3.1 we fix an arbitrary n and study whether a certain n-tuple of toric 
singularities is realizable as a set of singular points of a toric surface defined by an 
n-gon. Further, in Section 17.3.2, we fix an arbitrary m-tuple of toric singularities 
and construct an m-gon (with large enough n) whose singularities are exactly the 
singularities of the m-tuple. 


17.3.1 Toric singularities of n-gons with fixed parameter n 


We begin this subsection with triangles. The corresponding toric surfaces have at 
most three toric singularities. The relations on these singularities are described by a 
reformulation of Theorem 6.9(i). 


Corollary 17.10. Consider three arbitrary complex-two-dimensional toric singu- 
larities with sail pairs (a;,b;), where i = 1,2,3. Then the following two statements 
are equivalent: 


(i) There exists a complex projective toric surface of Euler characteristic equal to 3 
whose singular points have sail pairs (a;,b;) for i = 1,2,3. 
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(ii) There exist a permutation o € S3 and rational numbers c; € {a;,b;} for i= 1,2,3 
such that the following conditions hold: 

— the continued fraction Ieo1),—1,¢6(2)| is either negative, greater than Cg(1), or 
equal to ~; 

— ]eg(1);—1,¢6(2),-1,¢6(3) [= 0. 


Remark. Let us say one more time that one should use only odd continued fractions 
for c), C2, and c3 in the statement of the above proposition. 


When n > 3 we have the following weaker statement. It follows directly from 
Theorem 17.1. 


Corollary 17.11. Consider n arbitrary complex-two-dimensional toric singularities 
with sail pairs (a;,b;), where i= 1,...,n. Then (i) => (ii), where 


(i) There exists a complex projective toric surface of Euler characteristic n whose 


singular points have sail pairs (a;,b;), fori =1,...,n. 

(ii) There exist rational numbers c; € {a;,b;} for i= 1,...,n and a sequence of 

integers (m,,...,M,—1) such that the following condition holds: 
Je1,71,€2,™m2,.--,Mn—1,Cn[= 0. 


The following problem is still open. 


Problem 11. Find a necessary and sufficient condition for existence of a complex 
projective toric surface of Euler characteristic n in terms of sail pairs of its singular- 
ities and a sequence of integers M (as in Corollary 17.11). 


Notice that all integers of M are greater than —2. In addition, at least one of the 
elements of M is equal to —1. 


17.3.2 Realizability of a prescribed set of toric singularities 


Finally, we discuss the realizability of toric surfaces with a prescribed set of toric 
singularities. 


Proposition 17.12. For every collection (with multiplicities) of complex-two-dimen- 
sional toric algebraic singularities there exists a complex projective toric surface 
with this collection of singularities. 


In the proof of Proposition 17.12 we use the following lemma. 


Lemma 17.13. For every collection of integer angles a; (i =1,...,n) there exist an 
integer k > n—1 and a k-tuple of integers M = (mj,...,m) such that 


+m +++ +m, _, On +m, larctan 1 Figg 1 Fig larctanl = 27. 
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Proof. Consider any collection of integer angles a; (i = 1,...,n) and set 
DP =O +1 OH] +1-+- +1 Mh. 


It is clear that one of the LSLS sequences for ® is obtained by adding the LLS 
sequences of the angles a; and the number | taken n—1 times. All the elements of 
this LSLS sequence are positive integers, and hence @ is of the form @ + Oz for 
some integer angle 9. 

If @ = larctan 1, then we have 


@ +_» larctan 1 +_2 larctan 1 +_2 larctan1 = 27. 


Then k =n+2, and M = (1,...,1,—2,—2,—2). 
Suppose now that g $ larctan 1. Then the following holds: 


@+_1 7—@ +_2 arctan 1 +_2 larctan] = 27. 


Consider the sail for the angle 7—@. Suppose that the sequence of all its consecutive 
integer points (not only vertices) is (Bo,...,Bs), where the order coincides with the 
order of the sail. Then we have 


ZB,OB;.;=larctanl for everyi=1,...,5. 
Put b; = |sin 7B;OB;, for i= 1,...,s. We get 
@ +~_2 larctan 1 +_2 larctan 1 +_2 larctan 1 
= My +1 Oy +1 °°°+1 Gy 


+_, larctan1 +,, larctan 1 +), «+++, larctan 1 
+_, larctan 1 +_, larctan 1 +_2 larctanl = 27. 


Therefore, k = n+s+3, and 
M =(1,1,...,1,1,-1,b1,...,bs,-2,-2, —2). 
a 


(n—1) times 


The proof of Lemma 17.13 is complete. 


Proof of Proposition 17.12. Consider an arbitrary collection of surface toric al- 
gebraic singularities. Suppose that they are enumerated by integer angles a; (i = 
1,...,2). By Lemma 17.13 there exist k > n—1 and M = (mj,...,m,) such that 


(% — Oy) +imy + Fry (% — On) +m, larctan 1 +m,,, °° * +m, larctan 1 = 27. 


Then by Theorem 17.1 there exists a convex polygon P = Ag...Ax with integer 
angles proper integer congruent to the angles a; (i = 1,...,n), and the last k—n+1 
angles equal larctan 1. 
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By Theorem 17.8(c) all singularities of the toric surface Xp are contained in the 
set {Ao,A1,...,Ax}. By Theorem 17.8(d) the points A; corresponding to larctan 1 
are also nonsingular. The collection of toric singularities at the remaining points 
coincides with the given collection. This concludes the proof of Proposition 17.12. 


Example 17.14. Let us construct a projective toric surface having a unique toric 
singularity with the sail pair (7/5,7/3). Consider @ 2 larctan(7/5). First of all, we 
draw the angle 7—a and its adjacent angle s—(a—«a) (which is a). Further, we 
subdivide the half-plane in the complement to the union of @ and 7—q in a stan- 
dard way into two angles B and y (such that the resulting LSLS sequence ends in 
(—2,1,-2,1)). 


Secondly, we subdivide the angle ~@ = 1 — (1—«) into angles integer congruent to 
larctan 1, shown here: 


Finally, we construct the hexagon P all of whose angles are adjacent to the angles 
of the obtained decomposition. 


The toric surface Xp has a unique singular point; its sail pair is (7/5,7/3). 


17.4 Exercises 


Exercise 17.1. Consider an arbitrary toric surface Xp. Characterize all points of Xp 
where the action of C* x C* on Xp is (a) constant; (b) one-dimensional; (c) 
two-dimensional. 


Exercise 17.2. Write the toric surfaces for the following polygons 
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r-PY o- PR oa 


Enumerate all their singularities and find the sail pairs of the corresponding singu- 
larities. 


Exercise 17.3. Find a toric surface with a unique toric singularity whose sail pair is 
(5/3,5/2). 

Exercise 17.4. Prove that the sequence M in Corollary 17.11 has at least one nega- 
tive integer. Show that all negative numbers are equal to — 1. 


Exercise 17.5. Find an example of an integer quadrangle whose sequence M has 
exactly one negative integer. 


Part II 
Multidimensional Continued Fractions 
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In this second part we study multidimensional versions of several classical the- 
orems on continued fractions. For this purpose we use multidimensional continued 
fractions in sense of Klein—Voronoi. The material of this part is relatively recent, so 
we accompany it with related open problems and conjectures. In addition, we pro- 
vide links to classical problems and conjectures, such as the Littlewood conjecture, 
the lonely runner problem, White’s problem, etc. In conclusion, we briefly discuss 
other generalizations of continued fractions. 


As in the planar case, we begin with a discussion of basic features of multidi- 
mensional integer geometry. In Chapter 18 we introduce the main integer objects 
and present the corresponding invariants. Further, in Chapter 19 we study a more 
specific question related to emptiness of integer simplices, pyramids, and tetrahe- 
dra. In particular, we formulate and proof White’s theorem on three-dimensional 
empty integer triangles. 


In Chapter 20 we introduce multidimensional continued fractions in the sense 
of Klein, which are special polyhedral surfaces related to simplicial cones in real 
spaces. We study both finite and infinite Klein’s continued fractions. We prove that 
for a generic cone, the corresponding continued fraction has a good polyhedral 
structure (using the multidimensional Kronecker’s approximation theorem). Finally, 
we study the structure of two-dimensional compact faces of continued fractions in 
the sense of Klein. 


In the next two chapters, we investigate the periodic structure of algebraic mul- 
tidimensional continued fractions. The structure of such continued fractions is pe- 
riodic due to the action of the Dirichlet groups. We study some classical aspects 
concerning Dirichlet groups in Chapter 21. In particular, we describe several techni- 
cal details related to the calculation of the bases of those groups, and the questions 
of lattice reduction (e.g., the simplest LLL-algorithm). Further, in Chapter 22 we 
discuss periodic multidimensional continued fractions. Finally, we prove a gener- 
alized version of Lagrange’s theorem and explain the relation of multidimensional 
continued fractions to the Oppenheim conjecture. 


We discuss the multidimensional Gauss—Kuzmin statistics in Chapter 23. Firstly, 
we extend the Mébius measure of the projective plane to higher dimensions. Sec- 
ondly, we formulate general results regarding the generalized Gauss—Kuzmin 
statistics on relative frequencies of polygonal faces in continued fractions in the 
sense of Klein. Finally, we give examples of explicit calculation for certain integer 
types of faces. We conclude with several open problems on the distribution of faces 
in multidimensional continued fractions. 


In Chapter 24 we bring together several techniques to construct multidimensional 
continued fractions. There are two different algorithmic approaches to this issue. 
The first approach deals with a face-by-face construction of the Klein’s polyhedral 
continued fraction. In the second approach one makes an approximation of a sail 
and conjectures which faces of the approximation are the faces of the continued 
fraction. We discuss the corresponding algorithm and study one particular example. 
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Chapter 25 is dedicated to multidimensional Gauss’s reduction theory. We show 
that the Hessenberg matrices play the role of reduced matrices. Further we discuss 
a complete geometric invariant of conjugacy classes of integer invertible matrices 
in the simplest case that all eigenvalues are distinct. If a matrix has a real spectrum, 
then the corresponding invariant is its Klein’s continued fraction. For the remaining 
cases we use multidimensional continued fractions in the sense of Klein—Voronoi 
as invariants, which naturally extends Klein’s continued fractions to the case of ma- 
trices with arbitrary spectra. After that we present the technique of multidimensional 
Gauss reduction. Our next goal is to show how to solve Diophantine equations re- 
lated to Markov—Davenport characteristics, the main ingredient here is algebraic 
periodicity induced by the Dirichlet group. We conclude this chapter with an ex- 
haustive study of the new case of the family of three-dimensional operators having 
two complex conjugate eigenvalues. This part of material has never been published 
before. 


We investigate questions concerning best approximation of multidimensional 
simplicial cones in Chapter 26. In particular we discuss the relation to the maximal 
commutative subgroup approximation. Further we show that simultaneous approx- 
imation can be also considered in the context of maximal commutative subgroup 
approximation. 


Finally, in Chapter 27 we expose a collection of different generalizations of mul- 
tidimensional continued fractions mentioning interesting theorems related to them. 
We do not pretend that this collection is complete. The aim of this chapter is to show 
the diversity of methods that are used to study the extensions of classical problems 
of continued fraction theory. 


| ® 
Check for 
updates 


Chapter 18 


Basic Notions and Definitions of 
Multidimensional Integer Geometry 


In this chapter we generalize integer two-dimensional notions and definitions of 
Chapter 2. As in the planar case, our approach is based on the study of integer in- 
variants. Furthe,r we use them to study the properties of multidimensional continued 
fractions. First, we introduce integer volumes of polytopes, distances, and angles. 
Then we express volumes of polytopes, integer distances, and integer angles in terms 
of integer volumes of simplices. Finally, we show how to compute integer volumes 
of simplices via certain Pliicker coordinates in the Grassmann algebra (this formula 
is extremely useful for the computation of multidimensional integer invariants of 
integer objects contained in integer planes). We continue this with a discussion of 
the Ehrhart polynomials, which one can consider a multidimensional generalization 
of Pick’s formula in the plane. This chapter is concluded with the classification of 
elementary integer-regular polygons. 


18.1 Basic integer invariants in integer geometry 


Let us expand integer geometry to the multidimensional case. The main object of 
integer geometry is the lattice Z” in R", ie., the lattice of vectors all of whose 
coordinates are integers. 


18.1.1 Objects and the congruence relation 


We say that a vector (point) is integer if all its coordinates are integers. A plane in 
R” is called integer if its integer vectors form a sublattice whose rank equals the 
dimension of the plane. We say that a polyhedron is integer if all its vertices are 
integer. A cone in R” is said to be integer if its vertex is integer and each of its edges 
contains integer points distinct from the vertex. 
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The integer congruence relation is defined as in the two-dimensional case by the 
group of affine transformations preserving the set of integer points, i.e., Aff(n,Z). 
This group is a semidirect product of GL(n,Z) and the group of translations on 
integer vectors. We use =” to indicate that two objects are integer congruent. 


18.1.2 Integer invariants and indices of sublattices 


As in the planar case described in Chapter 2, the integer invariants in the multidi- 
mensional case are easily defined in terms of indices of certain sublattices. We first 
give a geometric description of the index of sublattices in terms of the number of 
integer points in certain parallelepipeds. 


Proposition 18.1. The index of a sublattice generated by integer vectors V,,...,Vx in 
an integer k-dimensional plane equals the number of all integer points P satisfying 


n 
AP=) Ay; with =O<A, <1, i€{l,...,n}, 
i=1 


where A is an arbitrary integer point. 


Proof. The proof of this statement is similar to the proof of the planar one. Let H 
be a subgroup of Z? generated by v;,..., vg. Define 


Par(v1,..-,¥x) = {yaw 0<A; <1, i= davivnh, 
i=l 


First, we prove that for every integer vector g there exists an integer point P lying 
in the set Par(vi,...,v,%) such that AP € gH. Let 


n 
&= y Livi. 
i=l 


Then consider 


P=A+ Y (aL) 


Since 0 < uj;—| uM; | < 1 for i=1,...,n, the point P is inside the parallelogram and 
AP € gH. 


Second, we show the uniqueness of such a point P. Suppose that for two integer 
points P;,P) € Par(v1,...,vg) we have AP; € gH and AP) © gH. Hence the vector 
P| P, is an element in H. The only element of H of type 


av+Bw with 0<a,B <1 


is the zero vector. Hence P; = P. 
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Therefore, the integer points of Par(v1,...,v,) are in one-to-one correspondence 
with the cosets of H in Z?. 


18.1.3 Integer volume of simplices 


Let us start with the notion of integer volume for integer simplices. We will define 
the integer volumes for polyhedra in the next subsection. 


Definition 18.2. An integer volume of an integer simplex A;...A, is the index of 
the sublattice generated by the vectors A2A1,...,A,Aj, in the integer lattice of the 
space Span(A2Aq,...,4,A1). Denote it by 1V(A;...A,). 

For the points A; ...A, that are contained in some (n—2)-dimensional plane we say 
that I1V(A;...A,) =0. 


Example 18.3. Consider a three-dimensional simplex S C R* with vertices 
sy = (2,3,0,1), s2=(1,4,2,4), 53 =(1,0,0,4), sa =(1,0,0,1). 


The integer volume of S is 6. 


We postpone the calculation of the volume of simplices for a while (see Exam- 
ple 18.32 below). 


18.1.4 Integer angle between two planes 


First, we give the definition of an integer angle for two linear subspaces. 


Definition 18.4. Consider two integer linear spaces L; and Ly that are not con- 
tained one in another. The integer angle between L, and Ly is the index of the sub- 
lattice generated by all integer vectors of L; UL in the integer lattice of the space 
Span(L;, Lz). Denote it by la (Li, Lz). 

In case one space is a subspace of the other we agree to say that the integer angle 
between them is zero. 


In order to calculate integer angles via integer volumes one can use the following 
proposition. 


Proposition 18.5. Consider two integer linear spaces L, and Ly that are not con- 
tained one in another. Let the sets of independent integer vectors 


{uy,.--,Uk;W1,---;Wm}, {V1,---,V,W1,---,>Wmp, and {wy,...,Wm} 
form bases in Ly, Lz, and L, \Lz respectively. Then we have 


adit) = TV (1, - 6-5 Uks Vise +e VIsW1,--+;Wm) IV (W1,---,Wm) 


IV (uy,.-- Uk, W1,---;Wm)IV(V1,.--5¥1;Wi5---5Wm) 
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Proof. First, let us change the basis (w;) of Lj Lz to the basis (w;) that generates 
the integer sublattice in L} (Lz. The value of the formula stays unchanged, since 
the numerator and the denominator are both divided by IV? (w1,---,Wm)- 

Second, we extend the basis (w;) with integer vectors i; to the basis of (a;,W;) 
that generates the integer lattice of L;. The value of the formula stays unchanged, 
since the numerator and the denominator are both divided by the same quantity 
IV(u1,..-,Uz,W1,---,Wm)- 

Third, we extend the basis (w;) with integer vectors ¥; to the basis of (¥;,W;) that 
generates the integer lattice of Ly. For similar reasons the value of the formula stays 
unchanged. 

From the definition we have 


IV(q,... HK, V1,.- +, Ve, W1, +++, Wm) =la(L,L2), 


IV(i1,..-,U,W1,---,Wm) = 1, 
IV(¥1,.--,0x,W1,---,Wm) = 1, 
IV(W1,...,Wm) = 1. 


For these vectors the statement of the proposition is clearly holds. This concludes 
the proof. 


Finally, we give a definition of the integer angle between arbitrary (affine) planes. 


Definition 18.6. Consider two integer planes 7 and 7. Let L; and Lz be the spaces 
of vectors corresponding to 7) and 7. The integer angle between 7 and 7 is the 
index la(L,L2). We denote it by la(7, 7m). 


18.1.5 Integer distance between two disjoint planes 


Definition 18.7. Consider two disjoint integer planes 7, and 7m. Let L; and Lz be 
the spaces of vectors corresponding to 7 and 7, and let a = A,A2 be a vector such 
that A; € 7 and A2 € 7%. The integer distance between 7, and 7 is the index of the 
sublattice generated by all integer vectors of Lj UL2 U Span(a) in the integer lattice 
of the space Span(Zy,L2,a). Denote it by ld(7, 72). 

In the case of an intersecting subspace of another we agree to say that the integer 
distance between them is zero. 


Remark 18.8. One of the simplest useful invariants in three-dimensional integer 
geometry is the distance between an integer point p and an integer plane 7, i.e., 
the index of the lattice generated by the integer vectors joining p and all integer 
points of z in the whole integer lattice of Z*. Integer distance has a nice geometric 
description. Draw all integer planes parallel to 7. One of them contains the point p 
(let us call it z,). The integer distance ld(p,7) is the number of integer planes in 
the region bounded by the lines 7, and 7 plus one. Let us illustrate by the following 
example: 


18.2 Integer and Euclidean volumes of basis simplices 233 


There are three integer planes parallel to 7 and lying between 7, and 7. Hence, 
Id(p,a) =34+1=4. 
Let us present a formula relating integer distances and integer volumes. 


Proposition 18.9. Consider two disjoint integer planes ™ and 7. Let L, and L» 
be the spaces of vectors corresponding to 7% and %, and let a be a vector with 
one integer endpoint in 7m and one integer endpoint in %. Suppose that the sets of 
independent integer vectors 


{uy,.--,Ue,Wi,---;Wm}, {V1,---5VyW1y--->Wmt, and {wy,..-,Wm} 
form bases in Ly, Lz, and L, \Lz respectively. Then we have 


TV (4, 01,6665 UksViy +++ 5 V1,W1,---;Wm) LV (w1,.--;Wm) 


ld(7, 72) = ; 
(71,72) IV(u1,..-,Uk,W1,--->Wm)IV(11,---,¥7,W1,---;Wm) 


The proof of Proposition 18.9 is similar to the proof of Proposition 18.5, so we 
skip it here. 

In this section we have defined several invariants of planes and pairs of planes. 
In general, many other nice invariants for different arrangements of integer planes 
(for instance, for triples of integer lines whose span is 5-dimensional) are defined in 
a similar way. 


18.2 Integer and Euclidean volumes of basis simplices 


Here we study simplices of maximal dimension whose edges generate an integer 
lattice. We prove Proposition 18.11, which is analogues to Proposition 2.15. 


Definition 18.10. An integer polyhedron is called empty if it does not contain any 
integer points other than its vertices. 


Denote by V(AoA1 ...An) the Euclidean volume of a simplex AoA1 ...An. Recall 
that 
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1 
V(ApA1..-An) = | det(AoA1 ,AoAa,---A0An) 


We also denote by P(Ag;A1,...,An) the parallelepiped 
n 

{Ao +) AArAo | 0<A;< i. 
i=l 


Proposition 18.11. Consider an integer simplex AoA, ...Ay in R". Then the follow- 
ing statements are equivalent: 

(a) P(A03A1,.--,;An) is empty; 

(b) IV(ApA,..-An) = 1; 

(c) V(AoA1.--An) = 4. 


Remark 18.12. In Section 2 we showed that every empty triangle has integer area 
1 (and Euclidean area 1/2). This is no longer true in the multidimensional case; for 
instance, see Example 18.33. We describe the situation in the multidimensional case 
in the next chapter. 


Proof. (a) => (b). Let a parallelepiped P(Ag;Aj,...,An) be empty. Therefore, by 
Proposition 18.1 there is only one coset for the subgroup generated by vectors 
AjAo,-.--,;AnAo. Hence, the vectors A;Ao,...,AnAo generate the integer lattice, and 
IV(AoA]..-An) = 1. 


(b) = (c). Let 1V(AoA1...An) = 1. Hence the vectors A,Ao,...,A,Ao generate the 
integer lattice. Thus every integer point is an integer combination of them, so for the 
jth coordinate vector e; we have 


n 
ej= y ijAiAo, 
j=l 


with integers A;; for 1 <i, j <n. Denote by M; the matrix where at the jth place of 
the ith column is A; j- Let also Mz be the matrix whose ith column is filled with the 
coordinates of the vector AgA; for i= 1,...,n. Then 


M, -M2 = 1d, 


where Id is the identity matrix. Since these matrices are integer, both their determi- 
nants equal either 1 or —1. Hence the Euclidean volume of the simplex AgA1...An 
coincides with the Euclidean volume of the coordinate simplex and equals a. 


(c) = (a). Consider an integer simplex AgA1 ...A, of Euclidean volume a. Suppose 
that the corresponding parallelepiped P(Ao;A1,...,An) has an integer point distinct 
from its vertices. Then there exists an integer simplex such that one of its facets (of 
codimension 1) coincides with a facet of AgA;...An, and the height of this facet is 
smaller than the corresponding height in AgA;...A,. Hence there exists an integer 
simplex whose Euclidean volume is smaller than a which is impossible (since the 
determinant of 7 integer vectors is an integer). 
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Proposition 18.13. For an arbitrary simplex S of full dimension we have the fol- 
lowing formula: 


1V(S) =V(P(S)). 


Proof. Let us prove this statement by induction on the integer volume of simplices. 
Base of induction. The statement for simplices of integer volume | follows directly 
from Proposition 18.11. 


Step of induction. Let the statement hold for all simplices of integer volume less then 
N (N > 1). Let us prove it for an arbitrary simplex S = vov, ...v, of lattice volume 
N. 

Consider a polyhedron P(S). By Proposition 18.11, since 1V(S) > 1, there is 
an integer point in P(S) distinct from the vertices. Without loss of generality we 
suppose that this point is not on any facet parallel to v1,...,v,. Hence 


d(vo,V1---Vn) > 1. 


Denote this value by do. 
Consider an integer point Vo at unit integer distance to the plane of the facet 
v1 ...V,. By Proposition 18.9 we get 


IV(vov1--- Vn) 


=dp. 
IV(Vov1 dant Vn) 
It is clear that for Euclidean volumes we have the same relation: 


V(P(vov1---Vn)) 


V(P(Vov1-.-Vn)) 


Since do > 0, we have 1V(¥ov1 ... vn) < N, and hence we are in a position to apply 
the induction assumption: 


IV(vov1.-- Vn) = dolV(Vov1..- Vn) = doV (P(Vov1 -.-Vn)) = V(P(vov1---Vn))- 


This concludes the proof. 


18.3 Integer volumes of polyhedra 


In this subsection we write down an expression for the integer volume of simplices 
via certain Euclidean volumes. Further, we use the additivity of Euclidean volume 
to get a natural extension of lattice volume to the case of integer polyhedra. Finally, 
we discuss how to decompose an arbitrary convex integer polyhedron into integer 
empty simplices. 
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18.3.1 Interpretation of integer volumes of simplices via Euclidean 
volumes 


We begin with a definition of determinants of sublattices. 


Definition 18.14. Consider a sublattice L of the integer lattice Z”. The Euclidean 
volume of the simplex whose edges generate L is called the determinant of L and 
denoted by det L. 


Let us say a few words about the determinant being well defined. Recall that ZL 
can be mapped by a linear (not necessarily integer) mapping to Z‘ C R*. In Propo- 
sition 18.11 we showed that all simplices whose edges generate Z‘ have the same 
Euclidean volume. Hence all simplices whose edges generate L also have the same 
Euclidean volume. 

In the following theorem we establish a relation between the integer volume of a 
simplex S and the Euclidean volume of the corresponding parallelepiped P(S). 


Theorem 18.15. Let S be a k-dimensional integer simplex in IR" and let L(S) denote 
the integer lattice of the k-dimensional integer plane containing S. Then we have 


V(P(S)) 


IV(S) = aeKLS)). 


Proof. Consider a linear map T sending L to Z‘ Cc R*. We have 


V(P(S)) 


IV(T(S)) =IV(S),  V(T(P(S))) = det(L(S))’ 


and det(T(L(S))) = 1. 


Hence the statement of Theorem 18.15 follows directly from a similar statement for 
simplices of full dimension: 


which appears in Proposition 18.13. 


18.3.2 Integer volume of polyhedra 


Using the result of Theorem 18.15 we extend the definition of integer area to general 
polyhedra. 


Definition 18.16. Consider an integer polyhedron P of dimension n. The integer 
volume of P equals 


which we denote by 1V(P). 
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As in the two-dimensional case, we have the additivity property for lattice area. 


Proposition 18.17. The integer volume of polyhedra is additive, i.e., if an integer 
polyhedron P is a disjoint union of integer polyhedra P,,..., Px then 


k 


VV 1V(A). 


i=] 


2 
= 
i 


Proof. By definition the integer volumes of polyhedra of dimension n are propor- 
tional to their Euclidean volumes. Therefore, the additivity of Euclidean volume 
implies additivity of integer volumes. 


18.3.3, Decomposition into empty simplices 


Let us formulate two theorems on the existence of decompositions into simplices. 
The first holds in arbitrary dimensions, while the second is specific to the three- 
dimensional case. 


Theorem 18.18. For every convex integer polyhedron P (not contained in a hyper- 
plane) there exists a decomposition into integer empty simplices. 


Proof. Let us give a sketch of the proof. The proof is based on induction on the 
number of integer points in P € R”. 


Base of induction. If P has only n+1 integer points inside, then it is an empty sim- 
plex. 


Step of induction. Suppose now that the statement holds for every k < m. Let us 
prove it for m. Consider an integer polyhedron P. Suppose that it contain points 
P1,-++5Pm- Let fi,..., fs be all the (2n—2)-dimensional faces of P. We fix one integer 
point of P, say some pj, and consider all pyramids with vertex at this point and 
bases in faces f|,..., fs. These pyramids subdivide P into integer polyhedra. If P 
contains at least n+2 points, then it is possible to choose p; such that this subdivision 
contains at least two smaller polyhedra (each of them contains a smaller number of 
integer points). By induction all the smaller polyhedra are decomposable; hence P 
is decomposable itself. 


The decomposition of Theorem 18.18 usually contains triangles of distinct inte- 
ger types that are not necessarily integer congruent to the coordinate simplex. In the 
paper [95] of J.-M. Kantor and K.S. Sarkaria, a stronger result for three-dimensional 
polyhedra is given. 


Theorem 18.19. For every convex integer polyhedron P in R°? there exists a decom- 
position of 4P into integer tetrahedra congruent to the basis tetrahedron. 


(Recall that nP is the polyhedron whose vertices have all the coordinates n-times 
greater than the coordinates of P.) 
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Idea of the proof. By Theorem 18.18 it is enough to find a proof for the list of empty 
tetrahedra, which is known due to White’s theorem (see Corollary 19.3 below). It 
turns out that for every empty tetrahedron T € R°, the tetrahedron 47 admits a 
decomposition into integer tetrahedra congruent to the basis tetrahedron. We skip 
here the technical details related to explicit constructions of these decompositions. 


18.4 Lattice Plucker coordinates and calculation of integer 
volumes of simplices 


In order to calculate integer volumes of integer simplices we embed the space of all 
k-dimensional sublattices into the Grassmann algebra. Further, we express the inte- 
ger volume of a simplex in terms of certain Pliicker coordinates of this embedding. 


18.4.1 Grassmann algebra on RR" and k-forms 


The Grassmann algebra (or exterior algebra) of the vector space R” is the alge- 
bra over R with multiplication (usually denoted by A), generated by elements 1, 
€1,---,€n, where e1,...,@, is the basis of R”. The relations defining the algebra are 


ej \ej = —ej Nei; 
e;A1 =. lAe=e;3 
1A1l=1. 


This algebra is denoted by A(R”), with the operation ( called the wedge product. 

Let r be a positive integer. Denote by A*(IR”) the vector space generated by the 
elements e;, \--:Ae;,, where i,...,i¢ € {1,...,n}. In addition we put A°R” =R. 
The space A*(IR") is called the kth exterior power of R and its elements are called 
k-forms on R. 


Proposition 18.20. (i) The dimension of the kth exterior power of R" is as follows: 


‘ k(7pn\ — Gs ifk <n, 
oe ={ 6 ifk >n. 


(ii) Any element of the exterior algebra can be written as a sum of k-forms, so we 
have 


A(R") = A°(R")@ A(R") @ . --@A"(R"). 


Let us define a natural basis of the space A(R") for an arbitrary k <n. Let J and J 
be two strictly increasing sequences of k elements of the set {1,...,n}. We say that 
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I> J if there exists s < k such that i, = j, fort <s andi; > js. We call this ordering 
lexicographic. 
Definition 18.21. For a strictly increasing sequence J of k elements of the set 
{1,...,n} we define 

Q = ej, \-++ Nei, 
The basis of A*(R”) consisting of all k-forms of type @; whose indices are ordered 
lexicographically is called the lexicographic basis of A(R"). 


Example 18.22. The dimension of the space A7(IR*) is 6. The lexicographic basis 
of A?(R*) is as follows: 


(ei Aen, e\/\e3, e,/Ae4, e2/e3, e2/ea, e3/\e4). 


18.4.2 Pliicker coordinates 


The set of all k-dimensional subspaces of R” is called the linear Grassmannian, 
and denoted by Gr(k,R”). Denote by P(A*(IR”)) the projectivization of the rth ex- 
terior power A*(R"), i.e., we consider all forms up to a multiplication by a nonzero 
constant. 


Definition 18.23. The Pliicker embedding of a Grassmannian into projective space 
is the map 
@ : Gr(k,R") > P(A*(R’)), 
Span(v1,...,vg) > P({vpA...Avg}). 


It is clear that the form v; \---/A vz up to a nonzero multiplicative factor does not 


depend on the choice of basis in Span(v1,..., vx). 
Let e1,...,@,) be a basis of R” and let @),...,@y be the corresponding lexico- 

graphic basis in A*(R"). 
Definition 18.24. Consider a k-dimensional linear subspace L in R” with basis 
V1,--+,Vx. Let 

N 

VipAr AVE = y Aiaj. 

i=l 

The Pliicker coordinates of L are the projective coordinates (A; :...: Ay). 


To compute the Pliicker coordinates one should calculate the determinants of 
all (kxk) minors of the (kxn) matrix whose elements are coordinates of vectors 
V15-++5Vk- 


Example 18.25. Consider a two-dimensional linear subspace of R? generated by 
vectors v; = (1, 1,1) and v2 = (1,2,5). We have 
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vpAVv2 = (ey +e. +63) A (ey + 2e7 +5e3) 


11 11 11 
= det t e; \e2 +det ¢ e; \e3 +det € e2 /\e3. 


Hence the Pliicker coordinates of this linear subspace are 
11 11 11 
(aa('3) saee(!2) sae(38)) =<0:4:3), 


18.4.3 Oriented lattices in R" and their lattice Pliicker embedding 


In this subsection we define a modification of the Pliicker embedding that is useful 
in the study of lattices. We say that a lattice is a k-lattice if it has k generators and 
they span a k-dimensional subspace in R”. A k-lattice is said to be oriented if we fix 
an orientation of the span. Denote the space of all oriented k-lattices by Lat(k,R”), 
which we call the oriented lattice Grassmannian. 


Definition 18.26. The lattice Pliicker embedding of an oriented lattice Grassmann- 
ian in the space A*(IR”) is the map 


@ : Lat(k,R") + A*(R"), 
(V1,---5 VE) DS VIA Avg, 


where vj,...,Vx is a basis of the lattice (v),...,v,) defining the positive orientation. 


Remark 18.27. It is clear that the lattice Pliicker embedding is well defined (since 
different generators of the lattice define the same k-form up to sign, which is also 
preserved for generators of positive orientation). 


We define lattice Pliicker coordinates in a similar way. 


Definition 18.28. Consider a k-lattice L in R” with generators vj,...,v, defining 
the basis of positive orientation. Let 


N 
ViAr Ave = Y AiQ;. 
i=1 


The lattice Pliicker coordinates of L are the coordinates (A,,...,Aw). 


Remark. Notice that the lattice Pliicker coordinates of lattices are affine and the 
arbitrary Pliicker coordinates of subspaces are projective. 


Example 18.29. As in Example 18.25, we consider a two-dimensional linear sub- 
space of IR? generated by vectors vj = (1,1, 1) and v2 = (1,2,5). Its lattice Pliicker 
coordinates are (1,4,3). 
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18.4.4 Lattice Pliicker coordinates and integer volumes of simplices 


If L is a sublattice of the integer lattice Z”, then all its lattice Pliicker coordinates 
are integers. Let us now show how to write the integer volume of integer simplices 
in terms of lattice Pliicker coordinates. 


Theorem 18.30. Let 5,52...5%41 be a simplex in IR" and let (p,,...,pn) be the 
lattice Pliicker coordinates for the lattice generated by vectors sj8x4 fori=1,...,k. 
Then 


IV(s182...541) = ged(pi,..., pwn). 
Denote by v; the vector sjs,,; for i= 1,...,k. We begin with the following 

lemma. 
Lemma 18.31. The number gcd(pi,..., py) is a GL(n, Z)-invariant. 
Proof. Let 

N 

vi Ao Avg = ¥ pi; 
i=l 


(as usual, (@1,...,@y) is the lexicographic basis associated to a chosen basis 
(€1,---,@n)). Consider an arbitrary A € GL(n, Z). We have 


N N 
A(v1A-++ Avg) =A(¥ pias) = Y" piA(@;). 
i=l i=1 


Since all k-forms A(q@;) have integer coefficients and all the numbers p; are divis- 
ible by gcd(p1,...,pn), the greatest common divisor for the lattice Pliicker coor- 
dinates for the form A(v; A---A vg) (denote them by (q1,...,¢n)) is divisible by 
gced(p1,...,pn) as well. 

For the same reason, applying the inverse operator A~! to the k-form A(@;), we 
have that gcd(q1,...,qy) is divisible by gcd(p1,..., py). Hence 


gcd(qi,..-,9nv) =gced(pi,.--, Pn). 


Therefore, gcd(pi,...,pn) is a GL(n, Z)-invariant. 


Proof of Theorem 18.30. The statement holds for all lattices in the plane spanned by 
€1,---,ex. In this case the first lattice Pliicker coordinate is the Euclidean volume of 
the parallelogram P(s1,52,...,541), which coincides with the integer volume of the 
simplex 5152 ...5%41 (see Corollary 18.15). All the other lattice Pliicker coordinates 
are zero. 

An arbitrary k-dimensional plane can be taken to the coordinate k-plane by some 
GL(n,Z) transformation. By Lemma 18.31, the gcd(p1,..., pw) is a GL(n,Z) in- 
variant. So in this case it is also equal to the integer volume of the simplex. 


Example 18.32. Let us calculate the volume of the three-dimensional tetrahedron 
S = 51825354 of Example 18.3. Recall that 
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Fig. 18.1 The empty tetrahedra 7; and 7). 


sy = (2,3,0,1), s2=(1,4,2,4), 53 =(1,0,0,4), sa =(1,0,0,1). 


This tetrahedron defines a three-dimensional sublattice in Z+ generated by vectors 
V1 = S184, V2 = 5284, and v3 = 5354. So, we get 


vyyAv2AVv3 = (ey + 3e2) A (4e2 + 263 +3e4) A 3e4 
= 0- e;Ae2/Ae3 + 12- e; AerAe4 +6-e;Ae3Ae4 + 18 - e2Ae3/Aeq. 


The lattice Pliicker coordinates are (0,12,6,18), and their greater common divisor 
is 6. Hence the integer volume of S equals 6. 


18.5 Ehrhart polynomials as generalized Pick’s formula 


In this section we discuss one of the generalizations of Pick’s formula. We start 
with the following example, showing that there is no straightforward combinatoric 
generalization of Pick’s formula. 


Example 18.33. Consider the following two tetrahedra (see in Fig. 18.1): 


1 
T, = conv((0,0,0), (1,0,0), ( ) ) ),(1,1,2)). 
These tetrahedra are both empty, but they have distinct areas: 
V(M%)=1/6 and V(t) =1/3. 


The emptiness of the tetrahedra means that they have only four integer points as ver- 
tices, and hence they have the same number of integer points in all the corresponding 
faces. 


The following estimate relates the number of integer vertices in the polyhedron 
and its Euclidean volume. 
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Theorem 18.34. (S.V. Konyagin, K.A. Sevastyanov [126].) The number of ver- 
tices of an n-dimensional convex integer polyhedron P is bounded from above by 


c(n)V(P) it, where c(n) is a constant not depending on P. 


So the generalization should have a different nature. Let us say a few words on 
Ehrhart polynomials, which to some extent generalize Pick’s formula. 


Definition 18.35. Let P be a d-dimensional convex integer polytope in R”, and let 
t be a real variable. The function 


L(P,t) =#(tPNZ") 
is called the Ehrhart polynomial. 


Actually, the function L(P,t) is a polynomial only to a certain extent. 


Theorem 18.36. (E. Ehrhart [60].) Consider an arbitrary convex polyhedron P in 
R”. Let us restrict the variable t to integers. Then the Ehrhart polynomial of P is a 
polynomial of degree d. 


So for the integer variable ¢ there exist rational numbers ao,...,ag such that 
L(P,t) = agt? sat > +++++do. 


Let us discuss the lattice-geometric meaning of several coefficients. 


Proposition 18.37. Let P be an integer polyhedron. Then 


ag =1; 
1 
oo = IV(F 
ad-1 2. (d= (5 
IV(P) 
ames 


The sum in the formula for ag_ is taken over all faces of P of codimension 1. 


The other coefficients of the Ehrhart polynomial do not have a simple combina- 
torial interpretation now. We refer the interested reader to a recent book by M. Beck 
and S. Robins [22]. 


Remark 18.38. Let us study the Ehrhart polynomials for integer polygons in the 
plane. Consider a polygon P, let J be the number of integer points in the interior of 
P, and let E be the number of integer points in the union of edges of P. Then we 
have 

L(P,t) =A(P)-??+E/2-t4+1. 


By Definition 18.35 we have 
L(P,1) =#(PNZ’) =I+E. 


The last two equations (for t = 1) imply Pick’s formula: 
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A(P) =I+E/2-1. 


Example 18.39. Let us write the Ehrhart polynomials for the tetrahedra 7, and 7) 
in Example 18.33: 
— 134,24 U Zs 
L(T,t) = gx +a°+ ext+1; 
L(T,t) = gx +x? + 3x+ 1. 


18.6 Integer-regular polyhedra 


We conclude this chapter with the classification of integer-regular polyhedra in ar- 
bitrary dimensions. 


18.6.1 Definition of integer-regular polyhedra 


In this subsection we collect several general notions related to integer-regular poly- 
hedra. Recall that a convex n-dimensional polyhedron in R” is the convex hull of a 
finite set of points that are not contained in a hyperplane of R”. 


Definition 18.40. A point v of a polyhedron P is a vertex if for every pair of distinct 
to v points (v1,v2) of the polyhedron P the segment v,v2 does not contain v. 

A convex k-dimensional polyhedron Q is a k-dimensional face of a convex polyhe- 
dron P if the following three conditions hold: 

— all vertices of Q are vertices of P; 

— the affine plane spanned by Q does not have vertices of P other than vertices of 
Q; 

— the polyhedron P is contained in one of the (multidimensional) half-planes with 
respect to some affine hyperplane containing Q. 


Note that in our notation vertices are 0-dimensional faces (and edges are 1- 
dimensional faces). 


We continue with the definition of supporting planes that is coming from theory 
of convex polyhedra. 


Definition 18.41. A hyperplane 2 is supporting for a polyhedron P if P is contained 
in one of the half-planes with respect to z and the hyperplane z contains at least one 
point of P. 


We continue with the definition of flags in R”. 
Definition 18.42. A flag F = (%,,...,;%—1) in R" is a collection of a point 7, a 


line 7, and planes 7; for k = 2,...,n satisfying the condition 


Mo CM C++ CM. 
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Definition 18.43. A flag F = (%0,71,...,%—1) in R" is said to be a flag of a poly- 
hedron P if there exists a sequence of faces of P: 


Qo CQ) C++: C On, 


where the dimension of Q; is i fori =0,...,n, such that 
Q,;C 7; fori=0,...,n—1. 


A polyhedron is integer if all its vertices are integer points. 
Now we are in position to define integer regular polyhedra. 


Definition 18.44. An integer n-dimensional polyhedron in R” is said to be integer- 
regular if for any pair of its flags there exists an integer affine transformation of R” 
sending the first flag to the second one. 


Finally let us define elementary integer polyhedra. 


Definition 18.45. We say that a point v = (tx),...,tx,) is a t-dilate of the point 
wW = (x1,---,Xn), We write v = tw. 

An integer polyhedron P, with vertices (tv1,...,tvg) is said to be the t-dilate of the 
polyhedron P) with vertices (v1,...,vz), denote it by P; =tP). 

An integer polyhedron is elementary if it is not t-dilate of an integer polyhedron for 
every integer ft > 1. 


18.6.2 Schldafli symbols 


Consider an arbitrary regular n-dimensional Euclidean polyhedron P. Let F = 
(70,---;%—1) be one of its flags, with corresponding faces Q; C 7j. 


We consider Q; as a solid homogeneous body and denote by O; the center of 
mass of Q; (for i=0,...,1— 1). Let also O,, denote the center of mass for P itself. 
The n-dimensional tetrahedron O90, ...On—1 Oy is called the chamber of a regular 
polyhedron P corresponding to the given flag. 


Denote by 7; (for i = 0,...,n—1) the reflection about the (n—1)-dimensional 
plane that spans the center of masses O,,...,Oj41, Oi-1,..-,Oo0 (except the i-th 
one), see Fig. 18.2. These reflections are called basic. 


We have the following classical statement. 


Proposition 18.46. The group of Euclidean symmetries of an Euclidean regular 
polyhedron P is generated by the reflections ro,r\,...,1n—1- 

Fori=1,...,n—1 the angle between the invariant hyperplanes of the symmetries 
rj, and r; equals m/aj;, for some integer a; > 3. 
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Definition 18.47. The expression {a,...,d,—1} is said to be the Schldfli symbol 
for a (flag of) the regular polygon P. For simplicity we write a* for the substring 
a,a,...,a of the length s in the Schlafli symbol. 


Since all the flags of a regular polyhedron are congruent, the Schlafli symbol for 
different faces is the same, hence we can treat it as a the Schlafli symbol for the 
polyhedron itself. 


ia) a) TO 1/5 
ry TL 


Oo PF ax Oo Oo 


Fig. 18.2 Basic reflections and the Schlafli symbols for regular polygons. 


18.6.3 Euclidean regular polyhedra 


Let us list all regular convex Euclidean polyhedra (up to Euclidean transformation 
and homotheties). 


List of regular Euclidean polyhedra. 

Dimension 1: here any segment is regular, its Schléfli symbol is {}. 

Dimension 2: for every integer m > 3 we have a regular m-gon, its Schlafli sym- 
bol is {m}. 

Dimension 3: Starting from dimension three we have only finitely many regular 
polyhedra in each dimension. In dimension 3 we have 


¢ {3,3}: a regular tetrahedron; 

¢ {3,4}: a regular octahedron; 

¢ {4,3}: a regular cube; 

¢ {3,5}: a regular icosahedron; 

¢ {5,3}: a regular dodecahedron. 


Dimension 4: In this dimension we have some exotic regular polyhedra. The list 
here is as follows: 


¢ {3,3,3}: a regular tetrahedron; 
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¢ {4,3,3}: a regular cube; 

¢ {3,3,4}: a regular generalized octahedron (the other names for this polyhedron 
is cross polytope, or hyperoctahedron); 

¢ {3,4,3}: a regular 24-cell (or hyperdiamond, or icositetrachoron); 

¢ {3,3,5}: a regular 600-cell (or hypericosahedron, or hexacosichoron); 

¢ {5,3,3}: a regular 120-cell (or hyperdodecahedron, or hecatonicosachoron). 


Dimension n (n>4): Surprisingly, in higher dimensions the situation stabilises. 
There are only three regular Euclidean polyhedra in such cases: 


¢ {3"-!1: a regular tetrahedron; 
* {4,3"-71: a regular cube; 
¢ {3"-? 4) a regular generalized octahedron. 


Remark 18.48. In the one- and two-dimensional cases the statement is straightfor- 
ward. Three-dimensional regular polyhedra were already known in ancient times. 
The cases of dimensions higher than three were studied by Schlafli in [188]. 


18.6.4 Preliminary integer notation 


In this subsection we generate several lists of integer polyhedra that will be later 
used in Theorem 18.49. 


First of all, denote by e; the i-th unit coordinate vector for all admissible positive 
integers i. 


The segment. Denote by {1} the unit integer segment. 


Tetrahedra. For any n > 1 and any integer p we denote by (37 the n- 
dimensional tetrahedron with the vertices: 


Vo =O; V;=O+e;, fori=1,...,n—1, 
forming the unit coordinate tetrahedron in the hyperplane x, = 0 and the vertex 


n—1 


V, = (p-1) y ex + Pen. 
kt 


Cubes. Recall that in general any cube C is generated by one of its vertices P and 
an n-tuple of linearly independent vectors v;, namely 


C= {P+ y Aji 
i=1 


0< a <1,i=1,...,n} 


We denote by 
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° {4, 
by 

° {4, 
by 


sai in 
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gras for any n > 2: the integer cube with vertex at the origin and generated 
all basis vectors; 


cura for any n > 2: the integer cube with vertex at the origin and generated 
the first n—1 basis vectors and the vector 


ey tegt...+@n_1 +2en3 


areey for any n > 3: the integer cube with a vertex at the origin and gener- 


ated by the vectors: 


ey and e,;+2e; fori=2,...,n. 


Generalized octahedra. Denote by 


° {3 aye for any n > 2: the integer generalized octahedron with the vertices 


OF 


t e; fori = 1,...,n. This polyhedron is the convex hull of the unit coordinate 


tetrahedron together with its centrally symmetric (about the origin) copy; 


° {302 ANE for any positive n: the integer generalized octahedron whose vertices 


are 


the vertices of {3"-3,4}%: 


O+te; fori=1,...,n—-1, 


and extra two vertices 


O+ (e1 teat... +n] + 2en); 


@ {ans Aye for any positive n: the integer generalized octahedron with the vertices 


O, O-e|, O-e,-e; fori=2,...,n, and e; fori=2,...,n. 


Two particular 24-cells. Here we have the following two polyhedra: 


* {3, 


4, a\4: the 24-cell with 8 vertices of the form 


O+2(e9+¢3+e4), O+2(e; +e2 +24), 
O+2(e; + 3 +e4), O+2e4, 


and 16 vertices of the form 


43) 


O+ (eo +e3 +4) ae (e1 +e2 +e4) “Te (e1 + e3 +e) + é4. 


4, ate the 24-cell with 8 vertices of the form 


O+2(e; +e2+e3+e4), Ot2(e; -—e2.+03 +84), 
O+2(e;+e2-—e3+e4), Ot2(e; +e2+¢3—€@4), 
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and 16 vertices of the form 


O+ (e1 +é2+63 +e4) oe (e1 —e2+63 +e4) 
+(e] + e2 — 63 +e) sc (ey +e. +e3—e4). 


18.6.5 Integer-regular polyhedra in arbitrary dimensions 


The following theorem was proved in [100] (see also [149, 154] for further devel- 
opment). 


Theorem 18.49. Any elementary integer regular polyhedron is integer congruent to 
exactly one of the following polyhedra: 
Dimension 1: the unit segment {}7. 


Dimension 2: As we have seen in Proposition 2.26, we have 6 polygons in di- 
mension 2: 


* the triangles {3}% and {3}4; 
° the squares {43% and {4}4; 
* the octagons {6}% and {6}%. 


Dimensions 3, 5, 6, 7, 8, ...: these dimensions are rather regular. Here we have 
the following three classes of integer regular polygons: 


¢ the tetrahedra {3r ie where positive integers i are divisors of n+1; 
* the generalized octahedra {3"~*,4}%, for i = 1,2,3; 
* the cubes {4,3"-7}%, for i =1,2,3. 


Dimension 4: Finally, as a rule dimension 4 is exceptional. Here we have four 
combinatorially different integer-regular types of polyhedra: 


° the tetrahedra {3,3,3}% and {3,3,3}4; 

* the generalized octahedra {3,3,4}%, for i = 1,2,3; 
¢ the 24-cells {3,4,3}% and {3,4,3}4; 

the cubes {4,3,3}%, for i= 1,2,3. 


Fig. 18.3 illustrates the adjacency diagram of the elementary integer-regular 
polyhedra. The list of integer-regular polyhedra in dimension 3 consists of 9 items; 
they are shown in Fig. 18.4. 


18.7 Exercises 


Exercise 18.1. Calculate the integer distance and the integer angle between the 
following two planes in R*: 
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{3,3, 4}3 
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(34,318 
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{Ae 3\4 
{4, 3, 3}4 


Fig. 18.3. Adjacency of the elementary integer-regular polyhedra. 
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Exercise 18.2. Find some integer invariants of arrangements of three lines in R”. 


Exercise 18.3. Prove the statements of Proposition 18.20. 


Exercise 18.4. Prove that any two integer k-dimensional planes in R” are integer 
congruent. 


Exercise 18.5. Find the Pliicker coordinates of the sublattice generated by the fol- 
lowing vectors: 


vy =(1,2,3,0,0), v2 =(1,1,1,1,1), and v3 = (0,0,3,2,1). 


Exercise 18.6. Find and prove the expression for the leading coefficient of the 
Ehrhart polynomial. 


Exercise 18.7. Calculate the Ehrhart polynomial for the tetrahedron 
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T =conv((0,0,0), (2,0,0), (0,3,0), (1,2,3)). 


| ® 
Check for 
updates 


Chapter 19 
On Empty Simplices, Pyramids, Parallelepipeds 


Recall that an integer polyhedron is called empty if it does not contain integer points 
other than its vertices. In this chapter we give the classification of empty tetrahedra 
and the classification of pyramids whose integer points are contained in the base 
of pyramids in R°. Later in the book we essentially use the classification of the 
mentioned pyramids for studying faces of multidimensional continued fractions. 
In particular, the describing of such pyramids simplifies the deductive algorithm 
of Chapter 24 in the three-dimensional case. We continue with two open problems 
related to empty objects in lattices. The first one is a problem of description of empty 
simplices in dimensions greater than 3. The second is the lonely runner conjecture. 
We conclude this chapter with a proof of a theorem on the classification of empty 
tetrahedra. 


19.1 Classification of empty integer tetrahedra 


Recall that in the two-dimensional case there is a unique class of empty triangles 
(see Proposition 2.15 and Remark 2.16). In this section we examine the situation in 
the three-dimensional case. We start with White’s theorem on geometric properties 
of empty tetrahedra. 


Let ABCD be a tetrahedron with enumerated vertices. Denote by P(ABCD) the 
following parallelepiped: 


{A+@AB+ BAC+ YAD|0< a@<1,0<B<1,0<y< 1}. 


Theorem 19.1. (White’s theorem [222].) Let ADBA’ be an empty tetrahedron. 
Then all integer points in P(ADBA') except the vertices are contained in one of 
the planes passing through centrally symmetric edges distinct from the edges of the 
tetrahedron. 
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C’ 


B 


Fig. 19.1 If ADBA’ is empty, then all integer points are in one of the dark gray three planes. We 
give an example with four points inside. 


Remark 19.2. For every parallelepiped there exist exactly three planes as in the 
theorem. See an example with four inner integer points in Fig. 19.1. 

Theorem 19.1 leads to a complete classification of empty integer tetrahedra with 
one marked vertex. 


Corollary 19.3. (i) Classification of marked empty tetrahedra. An integer empty 
tetrahedron with a marked vertex is integer congruent to exactly one of the following 
marked tetrahedra: 

— a tetrahedron with vertices (0,0,0), (1,0,0), (1,0, 1), and (1,1,0); 

_ Te, of the list “T-W,” where r>2, 0<§<r/2, and gcd(€,r)=1 (see Fig. 19.2). 

The point (0,0,0) is a marked vertex for all the tetrahedra in the list. 
(ii) Classification of empty tetrahedra. Any empty tetrahedron is integer congru- 
ent to an empty marked tetrahedron. The tetrahedra Ti and Pies (without marked 
vertex) are integer congruent if and only if r; =r and for r; > 1, at least one of the 
following four equations holds: 


(6) = (tv mod re 


Remark 19.4. Classifications of empty tetrahedra and empty marked tetrahedra co- 
incide only for r = 1,2,3,4,5,6,8,10,12,24. (This is the case when the square of 
every element of the multiplicative group (Z/mZ)* equals +1. Hence all the sum- 
mands in the multiplicative group are either Z/4Z or Z/2Z. There are 15 such 
groups. Only nine of them satisfy the condition. In addition we have the case r = 1.) 


We give all the proofs for the statements of this section in Section 19.4. 


19.2 Classification of completely empty lattice pyramids 


We say that a pyramid is marked if the vertex of the pyramid is specified. (So every 
tetrahedron corresponds to four marked pyramids, while all the other pyramids are 
in one-to-one correspondence with marked pyramids.) A pyramid in R? is called 
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The list |/Parameters |Coordinates of |Integer-affine type Id 
“T-W” the face of the face 
re, a>1, r>2, (€,r-1,—r), (0, 1) rt 

0<€<r/2, |(a+é,r—1,—-r), ~ 
ged(§,r)=1 |(6,7,—r) 
Up b>2 (1,b—1,2), 2 
(2,—1,2), 
(0,=1 »2) 
V (—2,1,2), 2 
(—1,-1,2), 
(1,2,2) 
Ww (0, 2,3), 3 
(1, 1,3), 
3 3) 


Fig. 19.2 The T-W list of empty pyramids. The origin (0,0,0) is the marked vertex for all the 
pyramids. 


integer if its vertex is an integer point and its base is an integer polygon. An integer 
pyramid is called completely empty if every integer point inside the pyramid is either 
contained in the base or coincides with the vertex of the pyramid. 

An integer pyramid is called one-story (multistory) if the integer distance from 
the vertex to the base is 1 (> 1). One-story integer empty pyramids can have an arbi- 
trary polygon as a base, since there is no geometric obstacle to constructing arbitrary 
pyramids. The multistory case is more interesting, as described in the following the- 
orem. 


Theorem 19.5. ((98]) Every multistory completely empty convex three-dimensional 
marked pyramid is either triangular or quadrangular. Every triangular multistory 
completely empty pyramid is integer congruent to exactly one of the marked pyra- 
mids from the list “T-W” (see Fig. 19.2). Every quadrangular multistory completely 
empty pyramid is integer congruent to exactly one of the following pyramids: 


Coords. |Coordinates of |Integer-affine type Id 
of the the base of the base 
vertex 
Map (0,0,0) |(—1,0,2), 2 
b>a>1 (—a—1,1,2), 
(—1,2,2), 
(b—1,1,2) 
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We skip the complete technical proof Theorem 19.5 (for a complete proof we 
refer to [98]). 


19.3 Two open problems related to the notion of emptiness 


Let us briefly mention two open problems related to arrangements of integer points 
in the interior of base parallelepipeds of integer sublattices. 


19.3.1 Problem on empty simplices 


In this subsection we say a few words about the following problem. 
Problem 12. Classify empty integer simplices in dimension n. 


From Proposition 2.15 it follows that all empty triangles are integer congru- 
ent to the coordinate triangle (n = 2). In Corollary 19.3 all distinct empty three- 
dimensional tetrahedra are listed (n = 3). For the rest of the cases (n > 4), the prob- 
lem is open. In the case n = 4, we would like to mention one interesting recent 
result. To formulate it we introduce a notion of the width of integer polyhedra. 


Definition 19.6. The width of an integer polyhedron P in R” is the minimal integer 
distance between two parallel integer hyperplanes containing P in the middle. 


Let us illustrate this definition with several examples. 


Example 19.7. The widths of the following triangles, 


4g 


are respectively 1, 2, and 3. These triangles have minimal possible integer area 
among the triangles of the given width: the areas are 1, 3, and 7 respectively. 


Example 19.8. The width of every empty three-dimensional tetrahedron is 1. 


The last example means that every empty tetrahedron is contained between two 
neighboring parallel integer planes (see Fig. 19.3). 
Almost all four-dimensional empty integer simplices have the following property. 


Theorem 19.9. (M. Barile et al. 2011 [20].) Up to possibly a finite number of 
exceptions, every empty simplex in dimension 4 has width | or 2. 


To conclude this subsection we observe once more that the complete classifi- 
cation of empty simplices in R* is still not known, while the situation in higher 
dimensions is entirely open. 
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L 7 


Fig. 19.3. All empty three-dimensional tetrahedra are of width 1. 


19.3.2 Lonely runner conjecture 


In this subsection we discuss the lonely runner conjecture, which is also known as 
view-obstruction problem for n-dimensional cubes. We start with a general formu- 
lation of the conjecture. 


Conjecture 13. (Lonely runner conjecture.) Suppose k runners having distinct 
constant speeds start at a common point and run laps on a circular track with cir- 
cumference 1. Then for any given runner, there is a time at which that runner is at 
arc distance at least 1/k away from every other runner. 


This conjecture was introduced by J.M. Wills [223] in 1967. Later it was consid- 
ered by T.W. Cusick (in [43], [45], and [46]) as the geometric view-obstacle prob- 
lem. The cases k = 2,3 are easy exercises. The conjecture holds for k = 4 (U. Betke 
and J.M. Wills [23], T.W. Cusick [45]), & = 5 (T.W. Cusick and C. Pomerance [48], 
see also in [24]), k = 6 (T. Bohman et al. [25]; see also in [24]), andk =7 (J. Barajas 
and O. Serra [17] and [18]). For k > 8 it is not known whether this conjecture is true. 

Let us describe the question in geometric terms. Without loss of generality we 
study the conjecture only for the first runner. We say that the starting point is the 
zero point of a circle. It is usual to put the speed of the first runner equal to zero. We 
associate each runner that travels with nonzero speed with a point on a circle. The 
configuration space of all (k—1)-tuples on the circle is the (k—1)-dimensional torus 
T'—! (which is the direct product of k—1 copies of circles $1! = R/Z). We mark 
the point (0,...,0) on this torus as the first runner (who is actually not running but 
standing). While the time goes from zero to infinity, the configuration of (k—1)- 
runners travels linearly in the torus T*~!. The question is whether this trajectory is 
always far from the middle point of the torus (1/2,1/2,...,1/2). 

To simplify the picture it is good to consider the universal covering space of 
T*-!, which is Ré~!. Here the standing runner is associated with an integer lattice 
Z*—!. Now we reformulate the lonely runner conjecture as a view-obstacle problem. 


Lonely runner conjecture as a view-obstacle problem. A hunter is standing in 
the middle (origin) of an n-dimensional garden. At each vertex of the shifted lattice 
Z-!4(1/2,1/2,...,1/2) a tree grows. All the trees are hypercubes of size 1 — 2/k 
centered at the vertex of the shifted lattice and with sides parallel to the coordinate 
axes. Suppose the hunter is not willing to shoot in the directions having at least one 
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zero coordinate. Then he will shoot into some tree. It is supposed that the bullet has 
Zero Size. 


Finally, we give a discrete lattice version of the lonely runner conjecture. 


Discrete version of lonely runner conjecture. Does there exist a full-rank sublat- 
tice of Z‘! (suppose it is generated by v1,...,vj%_1) such that the hypercube 


gai Peak 1} 


does not contain integer points. 


Remark 19.10. Let us say a few words about the inverse question of meeting of all 
the runners. Suppose & runners having distinct constant speeds start at a common 
point and run laps on a circular track. Then for every € > 0 and T > 0, there is a 
time t > T at which all the runners are at arc distance at most € away from each 
other. We leave this as an exercise to the reader. 


19.4 Proof of White’s theorem and the empty tetrahedra 
classification theorems 


We start with a proof of White’s theorem. Further, we deduce from it the classi- 
fication of empty tetrahedra. Before proving White’s theorem, we introduce some 
necessary definitions and prove several preliminary statements. 


19.4.1 IDC-system 


We begin with some necessary definitions. 


Definition 19.11. Let A, B, C, and D be integer points that span the entire space 
IR3 (the collection of points is ordered). Let us construct a system of coordinates 
related to these points. Let b, c, and d be the integer distances from the points B, 
C, and D to the planes of the faces ACD, ABD, and ACD, respectively. Consider the 
point A as the origin. Set the coordinates of B, C, and D equal to (b,0,0), (0,c,0), 
and (0,0,d) respectively. The coordinates of the other points in R? are defined by 
linearity. We say that this system of coordinates is an integer-distance coordinate 
system with respect to ABCD or (IDC-system for short). 


We call the points with integer coordinates in an IDC-system the IDC-nodes. We 
say that an IDC-node is integer if it is an integer point of R?. Notice that every 
integer point in the old system is an IDC-node. The converse is true if and only if 
the vectors AB, AC, and AD generate the integer lattice. The coordinates of integer 
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IDC-nodes coincide with the integer distances to the coordinate planes in the IDC- 
system. 


19.4.2 A lemma on sections of an integer parallelepiped 


If a and b are two integer vectors, denote by L,,» the integer sublattice generated by 
a and b. Consider the set of all cosets in Z? /Lap- Every left coset of this quotient 
group is contained in a plane parallel to the plane spanned by the vectors a and b. 
In addition, every such plane contains only finitely many left cosets, their number 
being equal to the index of the sublattice L, , in the integer sublattice in the subspace 
Span(a,b). 


Lemma 19.12. Consider an integer parallelepiped P = ABCDA'B'C'D' and a plane 
m parallel to ABCD. Let 1 intersect the parallelepiped (by some parallelogram). 
Then the following two statements hold. 


(i) The section of the parallelepiped P by the plane x contains representatives of all 
left cosets of Z? /Lq» that are in T. 


(ii) Every left coset of Z? /Lqy contained in 1 intersects the parallelepiped P, and 
the intersection is described by one of the following cases: 
(a) a point inside the parallelogram 1 P; 
(b) two points on opposite edges of %M P; 
(c) four points coincide with the vertices of TMP. 


We leave the proof of this lemma as a simple exercise for the reader. 


19.4.3 A corollary on integer distances between the vertices and the 
opposite faces of a tetrahedron with empty faces 


Corollary 19.13. All integer distances from the vertices of an integer (three-dimen- 
sional) tetrahedron with empty faces to the opposite faces are equal. 


Proof. Consider an integer tetrahedron OABC with empty faces. Suppose that 
ld(A, OBC) =n. 


Let us show that Id(B, OAC) = 1d(C, OAB) =n. 

Consider the parallelepiped P(OABC). Since the triangles OBC, OAB, and OAC 
are empty, the corresponding faces of P(OABC) are empty as well. Hence all faces 
of the parallelepiped are empty. Consider all the integer planes parallel to OBC 
that divide the parallelepiped into two nonempty parts. Since Id(A, OBC) = n, the 
number of these planes equals n — 1. 
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By Lemma 19.12 every such integer plane contains exactly one left coset of the 
lattice Z3 / Los,oc: Since the faces of the parallelepiped are empty, the intersection of 
this class with a parallelepiped is one point in the interior. Hence the parallelepiped 
P(OABC) contains exactly n — | integer points. 

Suppose that Id(B, OAC) = m, and ld(B, OAC) = k. Then for similar reasons the 
parallelepiped contains m— 1 and k — | interior integer points, respectively. There- 
fore, we have k=m=n. 


19.4.4 Lemma on one integer node 


For the proof of Theorem 19.1 we need the following lemma. Let Id(B,ACD) = r 
for some r > 1. Consider the IDC-system with respect to ADBA’ (with integer- 
distance coordinates (x,y,z)). Denote by B’, C, C’, and D’ the vertices with coor- 
dinates (0,7,r), (7,7,0), (,7,r), and (7,0,7r), respectively. 


Lemma 19.14. There is a unique integer node in the interior of the intersection of 
the plane x +y+z=r-+1 and the parallelepiped (here we restrict ourselves to the 
caser> 1). 


Proof. Since 1d(A,A‘B’CD) = r and the parallelogram A’B’CD is contained in the 
plane x+y+z=r, for every integer n, the plane x+ y+ z =n is integer. So the plane 
x+y+z=r-+1 is also integer. 

Consider a parallelogram obtained from A’B’CD by the shifting on the vector 
(0,0,1) in the IDC-system. This parallelogram is contained in the plane x + y+ 
z=r-+l. The edges of the shifted parallelogram are on the planes of faces of the 
parallelepiped and they do not contain vertices. Hence there exists a unique point 
in the interior of the shifted parallelogram, denoted by K (x0, yo,zo). Therefore, we 
have at most one point satisfying the conditions of the lemma. 

Let us show that K(x0, yo, Zo) is inside the parallelepiped. The shifted parallelo- 
gram is defined by the following four inequalities and one equation: 


O<x<y, O<y<vr, and x+y+z=r-+l. 


Since the point (0,0,r+ 1) is not integer (r+ 1 is not divisible by r), we have 
Zo <r. 

Suppose that zo < 0. In this case the vector KC is integer; hence the point 
K' =A+KC is an integer node (see Fig. 19.4). Notice that the point K’ is in the 
tetrahedron ADBA’ and it is distinct from the vertices of the tetrahedron. So ADBA’ 
is not empty, contradicting the conditions of the lemma. Therefore, we obtain zp > 0. 

Hence we get0 <x <r,0< yo <r, and 0 < zo <r. Since the edges of the shifted 
parallelogram do not pass through the vertices of the parallelepiped, the point K is 
in the interior of the parallelepiped. 
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B' Cc 


Fig. 19.4 The points K and K’ are the integer nodes. 


19.4.5 Proof of White’s theorem 


We prove the statement of the theorem by induction on the number of points inside 
the parallelepiped (or equivalently on the value of ld(A’, ABD)). 


Base of induction. Suppose that 1d(A’,ABD) = 2. Then the unique integer node in- 
side the parallelepiped is the center of symmetry of the parallelepiped. It is in the 
intersection of all planes passing through opposite edges. 


Step of induction. Suppose that the statement of the theorem holds for all empty 
tetrahedra with integer distance from the vertices to the opposite edges less than r. 
Let us prove the theorem for an arbitrary empty tetrahedron A’ABD with 


1d(A’, ABD) =r 


By Lemma 19.14 there exists a unique integer node in the planex+y+z=r+1 
inside the parallelepiped P(A’ABD), denoted by A’’(xo,yo,Z0) in the IDC-system 
with respect to the tetrahedron ABDA’. Without loss of generality, we suppose 
that z) > xo and zo > yo. The tetrahedron ADBA” is an empty integer tetrahe- 
dron, since all the nodes inside ADBA” except A” are inside ADCA’ as well. We 
get Id(A”,ADB) <r, and hence by induction we have that all integer nodes of 
P(ADBA") are either in the plane A” CD, in the plane A” BC, or in the plane B’” BD 
(where B” = A” + AB). Consider the intersections of these planes with the plane 
z = 1. One of these intersections should contain an integer node inside the paral- 
lelepiped P(ADBA"). 

Case 1. The integer nodes are in the plane A’ CD. The plane ACD is defined by 
the equation 
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r—-Xo 
X+ 


Z=nr. 
Zo 

The intersection of the plane A”CD with the plane z = 1 is the line (x,t, 1), where 

t is a linear parameter of the line and x; is a constant equal to r— a This plane 

contains integer nodes only if x; is integer. Let us estimate it (assuming that yo < Zo). 


On the one hand, 


E ey en ee 1 
or r-xo_ (r+1—zo YO) cy, 1-20 
20 i £0 x0 
2r-2+—>r-2. 
Z0 


On the other hand, x; <r. Since x; is an integer and r—2 < x; <r, we have x; = 
r—1. Therefore, there exist integer nodes with coordinates (r — 1,t,1), and hence 
by Lemma 19.12 one of them (with coordinates (r — 1,fo,1), for some integer fo) 
is contained in the parallelepiped P(ADBA’). So the rest of the integer nodes of the 
parallelepiped P(ADBA’) have the coordinates (r—k,(k-t9 modr),k), and hence 
they are all in the parallelogram A’B’CD. 

Case 2. The integer nodes are in the plane A’’BC. The plane A” BC is defined by 
the equation 


Seas 


Z=nr. 


The intersection of the plane A” BC with the plane z = 1 is the line (t,y,,1), where 
t is a linear parameter of the line and y, is a constant equal to r— a This plane 
contains integer nodes only if x; is integer. Let us estimate it (assuming that x9 < Zo). 
On the one hand, 


= EA Say 1 
r—yo asl, r—(r+ Zo —X0) os. 1 x0 
Z0 Z0 40) 20 
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On the other hand, y; <r. Since y; andr—2 < yj <r, we have y; =r—1. Therefore, 
there exist integer nodes with coordinates (t,r— 1,1), and hence by Lemma 19.12 
one of them (with coordinates (t9,r— 1,1), for some integer to) is contained in 
the parallelepiped P(ADBA’). So the rest of the integer nodes of the parallelepiped 
P(ADBA’) have the coordinates ((k-f9 modr),r—k,k), and hence they are all in 
the parallelogram A’B’CD. 


The integer nodes are not in the plane B’BD. Let us show that the case of the 
plane B” BD is an empty case. This plane is defined by the equation 


r—yo — Xo 
x+y + aM =r 
Z0 
The intersection of the plane BBD with the plane z = 1 is the line (44, 5*,1), 
where f¢ is a linear parameter for the line and a is a constant equivalent to r— 
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Such a line contains integer nodes only if a is integer. On the one hand. 


CMO se zo—1 
20 Z0 


>r-l. 


a=r 


On the other hand, we have a < r. Therefore, there are no integer nodes in the 
intersection of the parallelepiped P(ADBA’) (and of the parallelepiped P(ADBA") 
as well) with the plane z = 1, which is impossible. 


Therefore, the first two cases enumerate all realizable positions of integer nodes. 
In both cases the integer nodes are in one of the planes mentioned in the formulation 
of the theorem. Hence, the statement holds for an arbitrary pyramid ABCDA’ with 
1d(A’, ABD) = r. The proof of theorem is completed by induction. 


Remark 19.15. Notice that the third case in the proof is empty, since we assume that 
the third coordinate of A” is not less than the first and the second coordinates. 


19.4.6 Deduction of Corollary 19.3 from White’s theorem 


Proof of Corollary 19.3(i). We prove three statements: on completeness of the list, 
on emptiness of the tetrahedra in the list, and on pairwise noncongruence of the 
listed tetrahedra. 


Completeness of the list. Considering an integer empty tetrahedron A’ABD with a 
marked vertex A’, let 1d(A’,ABD) = r. If r = 1, then we get a tetrahedron integer 
congruent to (0,0,0), (1,0,0), (1,0, 1), and (1, 1,0). 

Suppose that r > 1. Then consider a point K a unit integer distance from the plane 
ABD lying in the parallelepiped P(ABDA’). By White’s theorem, K is in one of the 
three diagonal planes. Without loss of generality, we assume that K is in A’B’CD. So 
K has coordinates (€,r— 1,1) in the IDC-system related to ABDA’. Let us rewrite 
the coordinates of the point in the basis AB, AC, and AK (notice that it is a basis of 
the integer lattice). We have 


A=(0,0,0), B=(1,0,0), C=(0,1,0), and A’=(-€,1—r,7). 


Hence ABDA’ is integer congruent to Te 

The parameter € is relatively prime to r, since there would otherwise exist integer 
points on the edges distinct from the vertices. The tetrahedra ie and je are 
integer congruent. Therefore, we restrict the consideration of the tetrahedra to the 
case 0<€<r/2. 
Emptiness of the tetrahedra. All the tetrahedra are empty, since all integer points 
in P(ABDA’) are contained in one of the diagonal planes that does not intersect the 
tetrahedron in the interior of the parallelepiped. 
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Any two marked tetrahedra in the list are not integer congruent. Let us in- 
troduce the following integer invariant to distinguish the tetrahedra. Consider an 
arbitrary marked tetrahedron ABDA’ with vertex A’ and the related three-sided angle 
with vertex at A’ and a section ABD. By Lemma 19.14 this plane contains a unique 
integer point (denoted by K) on the plane parallel to ABD at integer distance r+ 1 
from A’. Integer distances to the faces of the three-sided angle are 1, €, and r— & for 
some €. The unordered collection [1,§,r —6&] is an integer invariant of the marked 
tetrahedron A’ABD. This invariant (together with Id(A’, ABD) = r) distinguishes all 
distinct tetrahedra in the list of the corollary. 


Proof of Corollary 19.3(ii). We prove the same three statements as in the proof of 
Corollary 19.3(i). 


Completeness of the list. Emptiness of the tetrahedra. By Corollary 19.3(i) every 
empty tetrahedron is integer congruent to at least one of the marked tetrahedra in 
the list of Corollary 19.3(i). We have already shown that all the tetrahedra in this 
list are empty. 


Integer congruence of tetrahedra in the list. The affine group of symmetries of 
tetrahedra S4 includes the affine group of symmetries of marked tetrahedra $3. For 
each of the four angles of a tetrahedron we construct the integer point uniquely 
defined by the conditions of Lemma 19.14. Direct calculation shows that the inte- 
ger 4-tuples of distances from these points to four planes of the faces of an empty 
tetrahedron are as follows 


(1,1,6,r-—6), (1,1,6,r-—6), (v,r—v,1,1), and (v,r—v,1,1), 


where r is the volume of the tetrahedron and the pair (€,v) satisfies €- v = (1 
mod r). So this unordered 4-tuple of unordered 4-tuples is an invariant of empty 


tetrahedra. The tetrahedra Re Sie us, and 7’~” in the list of Corollary 19.3(i) 
are integer congruent; all the other pairs of empty tetrahedra in the list have distinct 


4-tuples of unordered 4-tuples of distances, and hence they are not congruent. 


19.5 Exercises 


Exercise 19.1. Find triangles of width 4 with the smallest possible integer area. 
Exercise 19.2. Prove that the width of all empty 3-dimensional tetrahedra is 1. 
Exercise 19.3. Find proofs for the lonely runner conjecture for 2 and 3 runners. 
Exercise 19.4. Prove the statement of Remark 19.10. 

Exercise 19.5. Write the Ehrhart polynomial for all empty tetrahedra. 


Exercise 19.6. Prove the equivalence of the lonely runner conjectures in the classi- 
cal, view-obstacle, and discrete forms. 


| ® 
Check for 
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Chapter 20 
Multidimensional Continued Fractions in the 
Sense of Klein 


In the first part of this book we studied continued fractions from the geometric point 
of view of sails and their integer invariants. In this chapter we present a natural 
generalization of sails to the multidimensional case by F. Klein. 


20.1 Background 


In 1839, C. Hermite [84] posed the problem of generalizing ordinary continued 
fractions to the higher-dimensional case. Since then, there have been many differ- 
ent definitions generalizing different properties of ordinary continued fractions. In 
this book we focus on the geometric generalization proposed by F. Klein in [119] 
and [120]. Multidimensional continued fractions in the sense of Klein have many 
relations with other branches of mathematics. For example, in [215], H. Tsuchi- 
hashi described the relationship between periodic multidimensional continued frac- 
tions and multidimensional cusp singularities. M.L. Kontsevich and Yu.M. Suhov 
studied the statistical properties of random multidimensional continued fractions 
in [124], and later they were studied by the author in [101]. O.N. German [74] and J.- 
O. Moussafir [155] discussed the connection between the sails of multidimensional 
continued fractions and Hilbert bases. The relations to approximation theory of max- 
imal commutative subgroups is discussed by A. Vershik and the author in [109]. 
The combinatorial topological generalization of Lagrange’s theorem was obtained 
by E.L Korkina in [127] and its algebraic generalization by G. Lachaud [134]. The 
book [13] of V.I Arnold is a good survey of geometric problems and theorems asso- 
ciated with one-dimensional and multidimensional continued fractions in the sense 
of Klein (see also his articles [9], [11], and [12]). 


Originally, F. Klein introduced his multidimensional continued fractions in the 
context of studying decomposable forms with integer coefficients, which we dis- 
cuss later in more general settings in Chapter 25. Periodicity of multidimensional 
sails plays an important role in the study of algebraic irrationalities, since the pe- 
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riodic structure is defined by the induced action of the group of units in the orders 
of algebraic fields. Later, we give a multidimensional analogue of Lagrange’s theo- 
rem, in Chapter 22. Periods of sails are complete invariants of conjugacy classes of 
SL(n,Z) matrices in multidimensional Gauss’s reduction theory (see Chapter 25). 
In Chapter 23 we present statistical properties of multidimensional sails. Most of the 
interesting questions related to multidimensional sails appear already in the three- 
dimensional case, so within this book we pay the most attention to sails in R*. We 
observe that some three-dimensional theorems have quite simple generalizations to 
the case of higher dimensions, while some others become hard open problems even 
in the four-dimensional case. 


We say a few words about the other generalizations of multidimensional contin- 
ued fractions in Chapter 27. 


20.2 Some notation and definitions 


20.2.1 A-hulls and their boundaries 


We begin with several preliminary definitions. 

Consider a set S C R”. We denote by S the closure of S. A point p € S is called 
interior if there exists a ball in R” with center at p that is completely contained in 
S. The union of all interior points of S is called the interior of S. The boundary of 
S is the set of points in § not belonging to the interior of S. We denote it by 0S. An 
element of the boundary of S is called a boundary point of S. 

A simplicial cone in R" is the convex hull of k rays with the same vertex and 
linearly independent directions. (It immediately follows that k < n.) The vertex of 
a simplicial cone is the common vertex of the rays defining the cone. The rays are 
called the edges of the cone. The convex hull of any subset of rays defining the cone 
C is called a face of C. A simplicial cone is called integer if its vertex is integer. 
A simplicial cone is called rational if its vertex is integer and all its edges contain 
integer points distinct from the vertex. 


Definition 20.1. Consider an integer simplicial cone C of dimension d with center 
at the origin in R”. An A-hull of the cone C is the convex hull of all integer points 
lying in the closure C except the origin. We denote this set by A-hull(C). 


Now we formulate the classical concept of the sail, which is the cornerstone of 
the theory of multidimensional continued fractions in the sense of Klein. 


Definition 20.2. The boundary of the set A-hull(C) is called the sail of this cone. 
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20.2.2 Definition of multidimensional continued fraction in the 
sense of Klein 


Let us give the classical definition of multidimensional continued fractions. Con- 
sider a set of n+1 hyperplanes of R”*! passing through the origin in general position 
(i.e., whose union is homeomorphic of the union of all coordinate hyperplanes). The 
complement to the union of these hyperplanes consists of N = 2”*! open simplicial 
cones C},...,Cy. 


Definition 20.3. The set of all sails for all the cones C),...,Cy is called the n- 
dimensional continued fraction associated to the given n+1 hyperplanes in R”’*! 
passing through O in general position. 


In this chapter we investigate combinatorial and topological properties of sails. 
First we study the case of finite multidimensional continued fractions. Further we 
prove a generalized Kronecker’s approximation theorem (which is actually inter- 
esting by itself) and deduce from it general theorems on the polyhedral structure of 
sails. Finally, we discuss the situation around two-dimensional faces of sails in more 
detail. 


20.2.3 Face structure of sails 


Let us first recall several preliminary general definitions. 


Definition 20.4. Let S be a convex set in R”. A hyperplane 7 is said to be supporting 
if the following two conditions hold: 
(i) the set S is entirely contained in one of the two (multidimensional) half-planes 
defined by 7; 
(ii) TAS AO. 

To define faces of sails we use the following rather abstract definition. 


Definition 20.5. Assume that 7 is a supporting hyperplane for a convex set S and 
that S is not a subset of 2. Then we say that the set Fy = 7S is a face of S. 

The dimension of Fz is the dimension of a face. If Fz is a point or a segment (in- 
cluding rays), we say that the face is either a vertex or an edge. 


Remark 20.6. Notice that it is not necessary that the faces of A-hulls coincide with 
the faces of their closures. One of the illustrations of this is the cone of Exam- 
ple 20.20 below, shown in Fig. 20.4. The shaded area is not a face of A-hull(C), but 
it is a face of its closure. The face of the A-hull(C) is the dashed ray. 


Let us now introduce the notion of faces for sails. 


Definition 20.7. Consider a simplicial n-dimensional cone C. A subset F in the sail 
of C is called a face of the sail if F is a face of A-hull(C). 
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Fig. 20.1 A sail of a finite continued fraction. 


In the most interesting cases the set A-hull(C) is closed (see Theorem 20.24). So 
the following rather tautological proposition is of interest in studying the faces of 
sails. 


Proposition 20.8. Consider a simplicial n-dimensional cone C in R". Every face of 
the set A-hull(C) is a convex hull of a subset of Z”. 


20.3 Finite continued fractions 


Consider a rational simplicial cone C in R”*! with vertex at the origin. The sail of 
C has finitely many compact faces of all dimensions. In fact, all compact faces are 
contained in the pyramid OA;A2...A,+1, where A; is the first integer point on the 
ith edge distinct from O. We call sails of rational cones finite. If a multidimensional 
continued fraction has a finite sail, then all the other sails of this continued fraction 
are also finite; we call this continued fraction finite. 


Example 20.9. In Fig. 20.1 we give an example of a sail for a simplicial cone 
defined by the three vectors (3,2,3), (1,—4,3), and (—2, 1,3). 


We recall that the classification of two-dimensional cones (i.e., planar angles) and 
therefore the corresponding geometric one-dimensional sails is provided by LLS se- 
quences for both the finite and infinite cases in Chapter 4. The problem of describing 
multidimensional (in particular two-dimensional) finite sails is relatively new and 
currently has no consistent answer. Let us study the two-dimensional sails that have 
a unique compact face. It is clear that this face is triangular, since the cone has three 
edges. 


Theorem 20.10. Let the sail for a simplicial cone C in R> have a unique compact 
face. Then this face falls into one of the following two cases. 
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(i). If the integer distance from the origin to the triangular compact face is one, 
then every integer triangle is realizable as this face. Two sails whose unique compact 
faces are at integer distance one are integer congruent if and only if their triangular 
faces are integer congruent. 

(ii). Suppose that the face is at an integer distance to the origin greater than 1. 
Then the integer congruence classes of corresponding sails are in one-to-one cor- 
respondence with the list “T-W” of completely empty pyramids shown in Fig. 19.2 
(here each pyramid is associated to a simplicial cone centered at a marked vertex of 
the pyramid and having edges containing edges of the pyramid). The compact faces 
of the sails are exactly the bases of such pyramids. 


Proof. If the integer distance from the face to the vertex is one, then the statement 
is straightforward. If the distance is greater than one, then the statement follows 
directly from Theorem 19.5 on the classification of integer multistory completely 
empty marked pyramids. 


Later we formulate a more general theorem for two-dimensional faces at distance 
greater than | (see Corollary 20.36). 

For sails with the number of compact faces greater than 1 almost nothing is 
known. The following general problem remains open. 


Problem 14. Describe all finite two-dimensional sails (and the corresponding con- 
tinued fractions). 


The first tasks here are to investigate sails having two, three, four, etc., compact 
faces. 


20.4 On a generalized Kronecker’s approximation theorem 


Kronecker’s approximation theorem states the following: if 0 is an arbitrary irra- 
tional number, then the sequence of numbers {k@} (for k > 0) is dense in the unit 
interval. In this subsection we formulate and prove a multidimensional generaliza- 
tion of Kronecker’s approximation theorem. 

We begin with a preliminary definition of the operation of addition for sets. 


20.4.1 Addition of sets in IR" 


Let S and T be two subsets of IR”. We define the sum of S and T as follows: 
SOT ={p+q|pES,qeET}. 


We show an example of the sum of a triangle and a square in Fig. 20.2. 


The sum operation is invariant with respect to affine transformations. 
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Fig. 20.2. Addition of two convex polygons. 


Proposition 20.11. The sum of two convex sets is convex. 


Proof. The proof is straightforward. Consider two convex sets S and T. Let u,v € 
S@T. Then u=s; +t; and v= s2 +12, where s1,52 € S and t,,t2 € T. Hence for 
every A € [0,1] we have 


Aut(1—A)v=A(s,4+t)+(1—-A)(s2 +h) = (As, + (1—A)s2) + (At +(1—A)tp). 


The two summands in the last expression are in S and T respectively. Hence the sum 
is in S@T. Hence for any two points of S@T the segment connecting these points 
is also in S@T. Therefore, the set S@T is convex. 


20.4.2 Integer approximation spaces and affine irrational vectors 


Let us give some preliminary definitions. We call a subspace L integer if it contains 
an integer sublattice of full rank in L. 


Definition 20.12. Let u € IR”. We call the intersection of all integer vector subspaces 
of R” containing u the integer approximation space of u. It is denoted by Ry. 


Definition 20.13. A vector u € R” is called affinely irrational if its endpoint is not 
contained in any integer affine hyperplane of R,,. 


Example 20.14. (i) Consider the vector v = (2, V3). The integer approximation 
space of v is two-dimensional; the vector v is affinely irrational. 

(ii) The integer approximation space of (1, V2) is also two-dimensional (i.e., it is 
R?). Nevertheless, this vector is not affinely irrational, since its end is contained in 
the integer line x = 1. 

(iii) Now let u = (/2, V2). Then R, is the line y = x, and uw is affinely irrational. 
(iv) Finally, for the vector w = (1,1) we have that Ry, is the line y = x. The integer 
affine hyperplanes in Ry, are just integer points. The vector w is integer itself, so w 
is not affinely irrational. 


Further, we use the following proposition. 
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Proposition 20.15. Let u € R". Then for every A © R\S, where S is a countable 
set, the vector Au is affinely irrational. 


Proof. The set S is formed by the intersections of the integer affine hyperplanes in 
R, with the line @ = {Au | u € R}. No such integer hyperplane z contains the line 
£, since otherwise, 2 would contain R,; hence 7 intersects @ in at most one point. 
Since there are countably many integer planes, S is countable. 


20.4.3 Formulation of the theorem 


We are interested in the following generalization of the Kronecker’s approximation 
theorem (see [156] and [136]). 


Theorem 20.16. (Multidimensional Kronecker’s Approximation Theorem.) 
Consider an arbitrary vector u in R". 
(i) Let Ly = Ry, OZ". Then Ly, ® {ku | k € R} is dense in Ry. 


(ii) For every x € R, and every € > 0, the set B(x, €) ® {ku | k € R} contains infinitely 
many integer points. 


(iii) For every x € Ry, and every € > 0, the set B(x,€) ® {ku | k € R4} contains 
infinitely many integer points. 


(iv) There exists an integer sublattice T of rank n—dimR, such that 


Z" ® {ku | ke Z} =R, OT. 


(v) If the vector u is affinely irrational, we have that for every x © Ry and every 
€ > 0, the set B(x,€) @ {ku | k € Zs} contains infinitely many integer points. 


Example 20.17. Consider a vector u = [\/2, 3, /2 + V3]. Then the space R,, is a 
plane x; +x2 = x3 and the vector u is affinely irrational. Hence all five statements of 
the theorem hold. 


20.4.4 Proof of the Multidimensional Kronecker’s approximation 
theorem 


In the proof of Theorem 20.16 we use the following lemma. 

Lemma 20.18. Consider u € R” and let dim R, = d. Suppose that the integer lattice 
Ly = Ry NZ" has an integer basis (e1,...,€a). 

(i) For any nonzero A, we have Ry = Ry, 


(ii) Consider an affinely irrational vector u. Let 
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where the numbers Ag,A,,...,A, are rational, and additionally Ag # 0. Then Ry = 
R,, and the vector w is affinely irrational. 

(iii) Let u = (ay,...,aq_1, 1), in coordinates related to the basis e,,...,éq, denote 
i = (a1,...,aq_1,0). Then Ry = Span(e1,...,@q—1) and it is affinely irrational. 


Proof. Lemma 20.18(i) holds, since the linear spaces defined by u and Au coincide. 


Let us prove Lemma 20.18(ii). Let u be affinely irrational and let w be as in 
the statement of the lemma. Let us first prove that Ry = R,. A hyperplane of R, 
containing w is integer if and only if a hyperplane shifted by u—w containing w is 
integer. Hence Ry = R,, and w is affinely irrational. 


Finally we prove Lemma 20.18(iii). Consider an arbitrary integer subspace R 
containing a@. Then the span of R and eg contains u. Hence R,, is in the span of Ry 


and eg. Therefore, Rg coincides with the span of e),...,eg_1. 
Now we prove that @ is affinely irrational by reductio ad absurdum. Suppose that 
it is contained in some integer hyperplane of Span(e1,...,é@q—1) defined by a linear 


equation f = 0. Then u satisfies two independent linear equations f = 0 and xg = 0. 
Hence u is contained in some integer plane 7 of codimension 2 in R,,. Hence u is 
contained in the integer hyperplane Span(z,O) (where O is the origin) of R,, and 
hence R,, is not the integer approximation space of u. We arrive at a contradiction. 
The proof is complete. 


Proof of Theorem 20.16. Let us prove Theorem 20.16 by induction in dimension of 
Ru- 

Base of induction. The statement for the case dimR, = | follows directly from the 
classical Kronecker’s approximation theorem (recall that here u is an irrational num- 
ber). 

Step of induction. Let all the statements of the theorem hold for all vectors whose 
integer approximation space is (d — 1)-dimensional. We first prove statement (v) 
for an arbitrary affinely integer vector u with dimR, = d by reductio ad absurdum. 
Suppose that for some x € R,,, there exists € > 0 such that the set 


B(x,€) @ {ku |keE Zs} 


contains only finitely many integer points. Consider x9 = x + Nu for a sufficiently 
large N such that the set 
B(xo,€) ® {ku | k € Z} 


does not contain any integer point. For a pair of positive integers kj > kz > 0, denote 
by A(k1,k2) the vector 
{xo + kyu} = {xo +kou}, 


where for a vector r = (r),...,q¢), its quotient part is denoted by {r}, ie., 
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Fig. 20.3. Reduction of the dimension. 


{r}=(n—|r],---,ra— ral). 


It is clear that A (k;, kz) belongs to the unit coordinate cube. Since this cube is com- 
pact, there exist integers k, > ky > 0 such that 


A & 3 
\A (ki ,£s)| < ee 


Notice that A (kiko) # 0, since (ky _ ko)u ¢ Z". For every nonnegative m, the ball 
B(xo,€) @ {mA (ky, ky)} 

does not contain integer points, since this ball is obtained from the ball 
B(xo,€) ® { (ki — kyu} 

by a shift on the integer vector 
[xo + ku] — [xo + kau], 

where |r| is the vector floor function, which is equal to r— {r}. Hence the set 

B(xo,€) @ {mA (ky ko) | me Z4} 


does not contain integer points. By construction, this set contains an (€/2)-tubular 
neighborhood of a ray with vertex at xo and direction A(k,,k2), i.e., the set 


T = B(xy,€/2) @ {tA (ky, ko) | teE R,}; 


see Fig. 20.3 (left). Now we are interested in the points of intersections of the ray 
{xo +tA(k1,k2) | k € R;} with the integer planes parallel to the span of the vectors 
€1,---,@q—1- Let the first and second intersections with these planes be yo and yo + v. 
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Then all the intersections are in the set {yo + kv | k € Z,}. By construction, the set 
B(v0,€/2) @{kv |kE Zs} 


does not contain any integer point. Let yp and ¥ be the point and the vector whose 
first d—1 coordinates coincide with the coordinates of yg and v, respectively, and the 
last coordinate is zero. Since the last coordinate of yo and v are integers, the set 


B(¥o,€/2) © {kv | ke Z+} 


does not contain any integer point. 
By Lemma 20.18(ii) it follows that R, = R4(j, ;,)- Further, by Lemma 20.18(i), 
we get R, ae ae R,. Since the last coordinate of R, equals 1, we are in the situation 


of Lemma 20.18(iii). Hence the vector ¥ is affinely irrational and 
Rs _ Span(ey, see ,€d—1)+ 


Hence yo is in Rs. Then by the induction assumption for (d—1)-dimensional integer 
approximation spaces, the set 


B(¥o,€/2) ® {kv|k € Zy} 


contains infinitely many integer points. We arrive at a contradiction. Therefore, the 
statement of Theorem 20.16(v) is true for an arbitrary vector u with dimR, = d. 


Theorem 20.16(iii) is an easy corollary of Theorem 20.16(v): namely, for an arbi- 
trary u take a scalar A such that Au is affinely irrational (due to Proposition 20.15), 
and hence Theorem 20.16(iii) for u follows from Theorem 20.16(v) for Au. 

Theorem 20.16(/) and (ii) follow directly from Theorem 20.16(dii), and Theo- 
rem 20.16(iv) follows from Theorem 20.16(i). 


All these prove the correctness of the step of induction and conclude the proof of 
the multidimensional Kronecker’s approximation theorem. 


20.5 Polyhedral structure of sails 


In this section we study topological and combinatorial structures of sails. We first 
show that all sails are homeomorphic to real spaces. Then we discuss the integer 
polyhedral structure of sails. 
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Zo ___~ 


Fig. 20.4 The intersection of a face F with the closure of the A-hull of C. 


20.5.1 The intersection of the closures of A-hulls with faces of 
corresponding cones 


We start with the following proposition for arbitrary simplicial cones. 


Proposition 20.19. Consider a simplicial n-dimensional cone C in R" with vertex 
at the origin. Let F be a face of C or else F =C. Then we have 


A-hull(C) 0 F = A-hull(F) © F. 


If the dimension of the lattice contained in SpanF equals the dimension of F, we 
have 


A-hull(F) © F = A-hull(F). 


Example 20.20. In Fig. 20.4, we give an example of a three-dimensional simpli- 
cial cone C in R? with a two-dimensional face F containing a one-dimensional 
integer sublattice. The integer points of the face F are shown by black dots. The set 
A-hull(F) is the integer ray containing all the integer points of F’. It is shown dashed 
in the figure. The intersection of the set A-hull(C) with the face F is colored in gray. 


Proof. First, we prove that 
A-hull(F) @ F C A-hull(C) 9 F. 


Let x € A-hull(F’) @ F. This means that x = p+-v, where p is a point of the closure of 
the A-hull of F and v is a vector contained in the face F. If v = 0, then x coincides 
with p, which is in A-hull(F’) C A-hull(C). Suppose now that v 4 0. For every 
positive integer N we choose an integer point py = (a)(N),...,dn(N)) in C\ F 
such that 


v PN=P 1 

lv} |pw— pl] ~ N- 

(It is always possible to choose such point, to be sufficiently far from the origin, 
which we leave as an exercise for the reader.) Then the segment with endpoints p 
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and py is contained in the A-hull of C. Set 


Iv! 


iy = ——— (pn — p)- 
lpn — Pl 


By construction we have 
p+vwn € A-hull(C). 


Then 


v= lim Dy. 
Noo 


Hence 
pt+v=p + lim dy € A-hull(C) NF. 
—oo 


Therefore, we get 
A-hull(F) @ F Cc A-hull(C) 9 F. 


Second, we prove that 


A-hull(C) OF c A-hull(F) © F. 


Let x be not in A-hull(F) @ F. Then there exists a supporting plane P4—! C Span(F) 
of the sum A-hull(F’) © F that separates x and A-hull(F’) @ F (by Proposition 20.11 
the sum A-hull(F’) © F is convex), where d is the dimension of the plane Span(F). 
In addition, this hyperplane can be chosen such that it does not contain any vector 
of F. Then the intersection P’~! 1 F is a (d—1)-dimensional simplex. Denote its 
vertices by vj,...,v¥g. Then, first, the pyramid Ov, ...vg intersects A-hull(F) 6 F 
only in its base vj... vg, and second, the pyramid contains x in the interior. Let us 
choose the vertices vg ...V, on the edges of the cone C that are not adjacent to 
the face F’ sufficiently close to the origin such that all the integer points other than 
the origin (if any) of the pyramid Ov, ...v, are contained in the base v; ...v,. This 
implies that the set A-hull(C) intersects the pyramid only in its base. Since x is not 
in the base v; ... vy, it is not contained in A-hull(C) NF. 
So we have proved the first equality of the proposition. 


Suppose now that the dimension of the lattice contained in F equals the dimen- 
sion of F itself. Let us prove 


A-hull(F) © F = A-hull(F). 


It is sufficient to prove that for an arbitrary point x in the interior of A-hull(F’) and 
vector v € F the ray with vertex at x and direction v is in A-hull(F’). Since the integer 
lattice is of full rank in F’, for any edge e; of F there exist a sufficiently large constant 
Cy, and a sequence x,,(k) of distinct integer points inside F with distances to this 
edge not greater than Cy whose norm tends to infinity. Then, starting from some m, 
the ray with vertex at x and direction v intersects all simplices (%n(1),...,%m(d)), 
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where d = dimF. Hence this ray is in the convex hull of x and all the elements 
Xm(1),..-,;Xm(d) for m > 0. Therefore, this ray is a subset of A-hull(F). 


20.5.2 Homeomorphic types of sails 


The homeomorphic types of sails are described by the following theorem. 


Theorem 20.21. Every sail of an n-dimensional simplicial cone in R” is homeo- 
morphic to R""!. 


Let us prove the following lemma. 


Lemma 20.22. Let C be an n-dimensional simplicial cone in IR" and let x be an 
interior point of its A-hull. Then the shifted cone {x} ®C is contained in the interior 
of A-hull(C). 


Proof. Since x is an interior point of A-hull(C), there exists a ball with center at x 
contained in A-hull(C). Consider a point x; # x in this ball such that x —2x, €C. 
Then the cone {x} @C is in the interior of the cone {x;} @C. By Proposition 20.19 
in the case of F = C we have 


A-hull(C) = A-hull(C) NC = A-hull(C) @C. 


Hence, 
{xi} @C Cc A-hull (C). 


Therefore, the cone {x} @C is contained in the interior of the A-hull of the cone C. 


Proof of Theorem 20.21. Let C be a simplicial n-dimensional cone in R”. Consider 
an arbitrary interior point x of the A-hull of C and take an (n — 1)-dimensional 
sphere S, with center at x that is completely contained in the A-hull. Denote by 
C, the closed cone C @ {x}. Project the sail to the sphere S, along the radial rays 
with vertex x. Let us show that the projection is bijective with the set 5, \C, (see 
Fig. 20.5). 

First, by Lemma 20.22 the cone C, is in the interior of the A-hull of C, and hence 
there are no boundary points of A-hull (i.e., of the sail) that project to S, C,. 

Second, we show the surjectivity. Each ray with vertex at x and intersecting S, \ 
C, intersects the complement to the cone, which is not in the closure of the A-hull. 
Hence this ray contains at least one point of the boundary of the A-hull (i.e., of the 
sail). Therefore, the projection of the sail to the set S, \ C, is surjective. 

Finally, we prove the injectivity. Consider a ray r with center at x and intersecting 
the set S, \C,. Let x, be the last boundary point of the closure of the A-hull, i-e., all 
the other points of the closure of the A-hull are in between x and x, (see Fig. 20.5). 
Denote by P, the convex hull of the set {x,} US, The set A-hull(C) is convex and 
contains the point x, and the sphere S,.. Therefore, 
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i 


Ly 


Fig. 20.5 The set S, \C, and a point x,. 


P. C A-hull(C). 


Hence all points of the ray r between x and x; are interior points of the set A-hull(C), 
and hence they are not on the sail (which is the boundary of A-hull(C)). Therefore, 


d(A-hull(C)) Ar = {x}. 


Therefore, the projection is injective. 

We have shown that there is a one-to-one projection sending the sail to S, \ Cy. 
On the one hand, the projection of the sail is continuous. On the other hand, the 
inverse to the projection is also continuous (this follows directly from the convexity 
of the sail; we leave this as an exercise for the reader). The set S, \C, is homeomor- 
phic to an open (d — 1)-dimensional disk (and hence homeomorphic to R¢~!), and 
therefore, the sail is homeomorphic to R¢~!. 


20.5.3 Combinatorial structure of sails for cones in general 
position 


In this book we mostly discuss sails for cones that are either integer (i.e., each edge 
has an integer point distinct from the origin) or in general position. The combina- 
toric structure of the sails for integer cones is rather simple and was discussed in 
Section 20.3 above. Let us examine the situation for sails in general position. 
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20.5.3.1 Theorem on closeness of sails for cones in general position 


First, we give the definition of a cone in general position. 


Definition 20.23. A simplicial n-dimensional cone in R” is said to be in general 
position if 

— none of its faces contains integer points; 

— each of its edges contains a vector whose integer approximation space coin- 
cides with R”. 


It turns out that for cones in general position the set A-hull(C) is closed. 


Theorem 20.24. Let a simplicial n-dimensional cone C be in general position. Then 
the set A-hull(C) is closed. 


We begin the proof of Theorem 20.24 with two lemmas. 


Lemma 20.25. Let a simplicial n-dimensional cone C be in general position. Then 
A-hull(C) NAC = 90. 
Proof. Consider an arbitrary face F of the cone C. From Proposition 20.19 we have 


A-hull(C) 0 F = A-hull(F) © F =0@F =0. 


Hence the intersection of A-hull(C) with the union of all faces is empty as well. 


Lemma 20.26. Let C be a simplicial n-dimensional cone and let F be a face of 
the sail of C. Consider an interior point p of C contained in F. Suppose that ™ 
is a supporting plane of the set A-hull(C) passing through p. Then we have the 
following: 

(i) If TNC is not compact, then C is not in general position. 

(ii) If AC is compact, we have p € A-hull(F). 


Proof. Lemma 20.26(i). We prove the statement by contradiction. Assume that C is 
in general position. The hyperplane 7 divides the cone into two connected compo- 
nents, one of which, say C;, does not contain integer points in the interior (since it is 
a supporting plane of the of A-hull(C)). By construction, the set C; is not compact, 
and hence its intersection with one of the edges of the cone C is a ray. We assume 
that this edge contains a vector u whose approximation space is IR”. Now consider 
any interior point x of the set C). On the one hand, there exists a positive € such that 
the tubular neighborhood of the edge shifted to x: 


B(x,€) @ {ku|k € Rs}, 
is completely contained in C;. Therefore, by construction, 


B(x,€) @ {ku|kER,}NZ" =O. 
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On the other hand, by the multidimensional Kronecker’s approximation theorem 
(Theorem 20.16(iii)), this tubular neighborhood contains infinitely many integer 
points. We arrive at a contradiction. Hence Lemma 20.26(i) holds. 


Lemma 20.26(ii). Let #MC be compact. Consider a sequence of points p; € 
A-hull(C) that converges to p. By definition, A-hull(C) is the convex hull of a subset 
of integer points. Hence by Carathéodory’s theorem, for every i there exists an inte- 
ger simplex A; C A-hull(C) containing the point p;. Since 2 is a supporting plane, 
the distance from one of the vertices of A; to the plane 7 is smaller than |p — pj|. 
Since Z” is discrete and 7 MC is compact, there exists N such that for every i > N 
every simplex A; has an integer vertex in 7MC. Since 7MC is compact, there are 
only finitely many integer points in 7C. Hence we can choose an infinite subse- 
quence /; in the sequence p; all of whose simplices A; have the same integer vertex 
v, in the plane 7. 

If v; = p, then we are done. In case vj # p, for every A; we can choose a vertex 
w; # v1 such that the distance from w; to the plane z tends to zero as i tends to 
infinity. This is due to the compactness of 7C. Therefore, there exists V such that 
all w; belong to 7C for i > N. Since the set 


(ANC)NZ" 


is finite, we can choose an infinite subsequence Aj of A; whose vertices w '; coincide 
with an integer point of 7, say with vo. 

So each simplex A; contains an integer edge v, v2 in the plane 7. We continue 
iteratively constructing an infinite subsequence of A; with a common face vj... v4 C 
m. Either at some step k we have p € v,... vx, or we reach k = n, and therefore, there 
exists an infinite subsequence of (A;) all of whose elements contain the simplex 
V1 ...Vy, in 7 as a subset, and the last vertex is at positive integer distance from 7. In 
the latter case, 


PD EVI...Vn =V1.--Vn- 


Therefore, the point p is in A-hull(C). 


Proof of Proposition 20.24. Consider an arbitrary point p of the sail. As we know 
from Lemma 20.25, p is an interior point of the cone C. Let 7 be a supporting 
plane containing p. By Lemma 20.26(i), since C is in general position, the set 7C 
is compact. Then by Lemma 20.26(ii) we have p € A-hull(F’). Therefore, the set 
A-hull(F’) contains its boundary. 


20.5.3.2 Structural theorem on cones in general position 


Let us give a classical definition of extremal points and extremal rays. 


Definition 20.27. A vertex in a convex set is said to be extremal if it is not contained 
in the interior of any segment in the set. 
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An edge (or a ray) contained in a convex set is said to be extremal if every segment of 
this convex set either does not intersect the ray (edge) or at least one of its endpoints 
is in the ray. 


Remark 20.28. Notice that not every extremal vertex of a convex set S is a vertex 
of S. For example, consider the union of the half-disk {x* +? < 1 | y > 0} and the 
square with vertices (1,0), (1,—2), (—1,—2), and (—1,0). Here the points (+1,0) 
are extremal vertices and yet they are not vertices (i.e., 0-dimensional faces). 


Now we formulate the main structural theorem on cones in general position (see 
also [136]). 


Theorem 20.29. Let a simplicial n-dimensional cone C be in general position. Then 
we have 


(i) all faces of the sail for C are compact integer polyhedra; 
(ii) the set of all vertices of the sail is discrete; 
(iii) the sail does not contain rays; 


(iv) each vertex of the sail is adjacent to only finitely many extremal edges. 


Proof. We begin with Theorem 20.29(iii), proving it by contradiction. Let a cone C 
satisfy the conditions of the theorem and let its sail contain a ray r. Consider any 
support hyperplane of A-hull(C) containing this ray. This hyperplane divides the 
cone into two connected components, one of which, say C;, does not have integer 
points in the interior (since it is a supporting plane of A-hull(C)). By construction, 
C, is not compact, and hence by Lemma 20.26(i) the cone C is not in general posi- 
tion. We have arrived at a contradiction. Hence Theorem 20.29(iii) holds. 


Let us prove Theorem 20.29(i). Suppose, to the contrary a face F’ is not compact. 
Hence there exists a sequence of points p; in F, that increase in norm to infinity. 
Consider the sequence of points p; where 


iss 1Pi 
p= pit P1\Pi 
|P1Pil 


All the elements of this sequence are points in the unit sphere centered at p;. Since 
the sphere is compact, there exists at least one accumulation point p in the sphere. 
Hence the ray with vertex at p, and direction pjp is entirely contained in F = 
F. Therefore, by Theorem 20.29(iii) the cone C is not in general position. This 
contradicts the condition of the theorem. Hence the face F is compact. 

By Proposition 20.8 the face F’ is the convex hull of a subset of Z”. Compactness 
of F implies the finiteness of Z’ F. Therefore, F' is a compact integer polyhedron. 
This concludes the proof of Theorem 20.29(i). 


Let us prove Theorem 20.29(iv). Suppose the statement is false, and a vertex p is 
adjacent to an infinite number of extremal edges. Since Z” is discrete, the length of 
such edges is not universally bounded. Hence the closure of the union of all these 
extremal edges contains a ray. Since all edges are extremal, this ray is in the sail. 
This contradicts Theorem 20.29(iii). 
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Finally, Theorem 20.29(ii) holds, since the integer lattice is discrete. 


20.5.4 A-hulls and quasipolyhedra 


To conclude this section we mention just one result on the quasipolyhedral structure 
of sails. 


Definition 20.30. A convex set in R@ is called quasipolyhedral if its intersection 
with every polytope is polyhedral. 


In some of the literature, quasipolyhedral sets are called generalized polyhedra. 
They were studied for the first time by V.L. Klee in [118] in the framework of 
separation theory. Let us list several properties describing quasipolyhedral sets. 


Theorem 20.31. A convex set P is quaspolyhedral if and only if the following three 
conditions are satisfied: 

— P is closed; 

— the set of extremal points is locally finite; 

— for every extremal point p, the set of all edges and extremal rays meeting at p 
is finite. 


Let us apply this theorem to sails. 


Corollary 20.32. Consider a simplicial n-dimensional cone C in R". The closure 
of the A-hull of C is a quasipolyhedral set if every only if any face of C containing 
a nonzero integer point spans an integer affine space (i.e., a space with full-rank 
integer sublattice in it). 


For more information we refer to [136]. 


20.6 Two-dimensional faces of sails 


We conclude this chapter with a collection of some results and questions on two- 
dimensional faces of continued fractions. We distinguish two essentially different 
situations: questions related to faces that are a unit integer distance from the origin, 
and questions related to faces that are integer distance from the origin greater than 
one. 


20.6.1 Faces with integer distance to the origin equal one 


All two-dimensional faces of the sails of continued fractions are convex integer 
polygons. If the integer distance to the origin is 1, then the only restriction for a 
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polygon P to be a face of an n-dimensional continued fraction is whether it is pos- 
sible to inscribe P in some (n+1)-gon Q (not necessarily integer!) such that the 
convex hull of all integer points in Q coincides with P. 


Proposition 20.33. Every n-gon is realizable as a face of an m-dimensional contin- 
ued fraction ifm >n—1. 


Proof. It is clear that every integer n-gon P can be inscribed in an m-gon Q, for 
m >n, such that the convex hull of all integer points in Q coincides with P. We 
leave the details of the proof to the reader. 


So the main question arises when m <n—1. 


Problem 15. Which n-gons are not realizable as faces of an m-dimensional contin- 
ued fraction? 


The answer to this question is not understood even in the case m = 2. 
Let us introduce a family of examples of quadrangles that are not realizable as 
faces of two-dimensional continued fractions. 


Proposition 20.34. For arbitrary integers b > a> 1, the quadrangle with vertices 
(—1,0), (—a—1,1), (—1,2), (b—1,1) cannot be a face of a two-dimensional con- 
tinued fraction. 


Proof. Suppose that the statement is false. Let there exist a two-dimensional con- 
tinued fraction that has one of the compact faces (say F’) integer equivalent to a 
quadrangle with vertices (—1,0), (—a—1,1), (—1,2), (b—1,1) for some integers 
b>a-2= 1. Let 7 be the plane of the face. Let us introduce integer coordinates on 
the plane z such that the coordinates of the vertices of the face F are exactly (a,0), 
(0,1), (—b,0), and (0,—1). Notice that a point in this plane is integer (as a point in 
>) if and only if it is integer in the new coordinates in 7. 

Every two-dimensional simplicial cone defining a two-dimensional continued 
fraction has exactly three faces of dimension 2. So the intersection of the edges of 
the cone with the plane 7 is a triangle, which we denote by T. Since F is a face 
of the corresponding continued fraction we have, first, that the face F is contained 
in the interior of the triangle T and, second, that the set T \ F does not contain any 
integer points. Notice that the points (1,1) and (1,—1) are not in F, and the point 
(1,0) is in F. In addition, the points (1,0), (1,1), and (1,—1) are contained in a 
straight line. This implies that the open angle (denoted by ©) with vertex O(0,0) 
and edges passing through the points (1,1) and (1,—1) respectively, contains at 
least one vertex of the triangle T (see Fig. 20.6). 

For two adjacent angles to @ and for the opposite angle @, a similar statement 
holds. Therefore, each of these four nonintersecting angles has a vertex of T, which 
is impossible, since T is a triangle. We have arrived at a contradiction. 


All the questions discussed in this subsection are related to the two-dimensional 
case. Similar questions are also interesting for the multidimensional case as well, 
though it is clear that the complexity rises with the dimension. 
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Fig. 20.6 One of the vertices of T is in the shaded (open) angle. 


20.6.2 Faces with integer distance to the origin greater than one 


We showed in the previous section that every two-dimensional face is realizable as 
a face of some continued fraction (probably of a large dimension). It turns out that 
the majority of such faces are a unit integer distance from the origin. There is a 
complete description of all integer faces that occur at integer distance greater than 
one. 


Theorem 20.35. Every two-dimensional bounded face an integer distance greater 
than one from the origin forms a multistory completely empty pyramid. 


Therefore, the list of such faces is a sublist of all multistory two-dimensional 
continued fractions (see Theorem 19.5). 


Corollary 20.36. (i) The list of faces at integer distance greater than | for k- 
dimensional continued fractions for k > 2 coincides with the list of bases of mul- 
tistory marked pyramids. 


(ii) If k = 2, the list of all admissible faces coincides with the list of bases of trian- 
gular multistory marked pyramids. 


Proof. Existence of all admissible faces follows directly from Theorem 19.5 and 
Proposition 20.33. 

Quadrangular faces are not realizable in the two-dimensional case due to Propo- 
sition 20.34. 


We conclude this chapter with the following open question. 


Problem 16. Which three-dimensional polytopes are realizable as three-dimensional 
faces of sails at integer distance greater than 1. 


20.7 Exercises 


Exercise 20.1. Prove the one-dimensional Kronecker’s approximation theorem: if 
6 is an arbitrary irrational number, then the sequence of numbers {k@} (for k > 0) 
is dense in the unit interval. 
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Exercise 20.2. Construct the sail for the cone with vertex at the origin and generated 
by integer vectors (1,1,1), (—1,2,4), and (2,1,4). 


Exercise 20.3. (a) What is the sum of a round disk and a quadrangle? 

(b) Suppose the sum of a convex k-gon and a convex /-gon is an m-gon. For which 
triples (k,/,m) is this possible? 

(c) Suppose the sum of a convex k-dimensional polyhedron and an /-dimensional 
polyhedron is an m-dimensional polyhedron. For which triples (k,/,m) is this pos- 
sible? 


Exercise 20.4. Let C be a simplicial cone with the vertex at the origin and let F be 
one of its faces. Consider a point p in F and a vector v of F.. Prove that for every 
positive integer N there exists an integer point py = (a1(N),...,@n(N)) inC\ F such 
that ; 
v = 

PN—P Dee 
ly] |pw—pl| N 
Exercise 20.5. Every n-gon is realizable as a face of an m-dimensional continued 
fraction if m>n—1. 


Exercise 20.6. For an arbitrary construct an n-gon that is realizable as a face of a 
two-dimensional continued fraction. 


| ® 
Check for 
updates 


Chapter 21 
Dirichlet Groups and Lattice Reduction 


Let us study the structure of Dirichlet groups, which is actually the main reason for 
the periodicity of algebraic multidimensional sails (see Chapter 22). The simplest 
case of two-dimensional Dirichlet groups was studied in Chapter 12 in the first part 
of this book. In this chapter we study the general multidimensional case. 

We start with Dirichlet’s unity theorem on the structure of the group of units in 
orders in Section 21.1. Further, in Section 21.2 we describe the relation between 
Dirichlet groups and groups of units. In Sections 21.3 and 21.4 we show how to 
calculate bases of the Dirichlet groups and positive Dirichlet groups respectively. 
Finally, in Section 21.5 we briefly discuss the LLL-algorithm on lattice reduction 
which helps to decrease the computational complexity in many problems related to 
lattices (including the calculation of Dirichlet group bases). 


21.1 Orders, units, and Dirichlet’s Unit Theorem 


In this subsection we give some preliminary material on the field theory related to 
the structure of the units in orders. 


Definition 21.1. Let K be a field of algebraic numbers and [,...,[m an ar- 
bitrary finite sequence of numbers in K. The set M of all linear combinations 
City +:+:+ mlm with integer coefficients c),...,Cm is called a module of the field 
K. The numbers [),..., [4 are called the generators of the module M. 


Every module ™ in an algebraic field contains a basis, i.e., a sequence [1,..., Ln 
that generates M as linear combinations with integer coefficients such that only the 
trivial combination represents zero. It is clear that the number of elements in the 
basis coincides with the number of linearly independent elements (over Q) in M. 


Definition 21.2. Let K be a field of algebraic numbers of degree n and let M be a 
module in K. If M contains n linearly independent (over Q) numbers, then it is said 
to be complete, otherwise it is called incomplete. 
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Definition 21.3. A complete module in a field of algebraic numbers K is an order 
in K if it is a ring containing 1. 

Example 21.4. Let K be an algebraic number field. The set of all numbers in K 
whose minimal polynomial has integer coefficients is an order in K. This order is 
often called maximal, since every other order of K is contained in this order. 


Definition 21.5. Let D C K be an arbitrary order. A number € in D is a unit if e~! 


is contained in D. 


Theorem 21.6. (Dirichlet’s unit theorem.) Let K be a field of algebraic numbers 
of degree n = s+ 2t. Consider an arbitrary order D in K. Then D contains units 
€1,...,€- for r=s-+t—1 such that every unit € in D has a unique decomposition of 
the form 


dy 


e=€el!.--&€ ‘ 


where a,,...,@, are integers and € is a root of | contained in D. 


We refer to [26] for a proof of this theorem. 


21.2 Dirichlet groups and groups of units in orders 


Let us describe the relation between Dirichlet groups and groups of units in orders 
in algebraic number fields. 


21.2.1 Notion of a Dirichlet group 


Consider an arbitrary integer matrix A with characteristic polynomial irreducible 
over Q. 


Definition 21.7. Denote by I"(A) the set of all integer matrices commuting with A. 
(i) The Dirichlet group © (A) is the subset of invertible matrices in (A). 


(ii) The positive Dirichlet group £4(A) is the subset of &(A) that consists of all 
matrices with positive real eigenvalues. 


The matrices of (A) form a ring with standard matrix addition and multiplica- 
tion. (As a group, I (A) is isomorphic to Z"*!.) 


21.2.2 On isomorphisms of Dirichlet groups and certain groups of 
units 


Consider an arbitrary matrix A with characteristic polynomial 7,4 irreducible over 
Q. Denote by QJA] the set of all matrices that are polynomials in A with rational 
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coefficients. It is clear that Q[A] is a ring contained in Span(Id,A,...,A”~!). This 
ring is naturally isomorphic to quotient ring Q|x]/7a(x). 

Consider an eigenvalue € of A. It is clear that the field Q[&] is also isomorphic to 
Q(x] /%4 (x). Hence there exists a natural isomorphism 


hae : Q\A] > Q{S], 


sending p(A) to p(&) for every element p of the quotient ring Q[x]/74(x). Since 
= (A) is contained in Q|A], we have an embedding: 


e: E(A) > QS]. 


In the proofs of Proposition 21.8 and Theorem 21.9 we use the following notation: 


ha 


? 


Ps = {p € Q[x]/Za(x) | p(A) € Mat(n,Z)}. 


From the definitions it is clear that 


hy eV (A) = {p(S) | PE Pat. 


Proposition 21.8. Consider an arbitrary integer matrix A with characteristic poly- 
nomial irreducible over Q. Let be one of eigenvalues of A. Then the set hy ¢(I'(A)) 
is an order in Q(&). 


Proof. First, the set P4 described above is closed under addition and multiplication, 
since if p,(A) and p(A) are integer matrices, then (p; + p2)(A) and (p;p2)(A) are 
also integer matrices. Hence hy ¢(I”(A)) is closed under addition and multiplication. 
Since QJA] is a ring (even a field), the addition and multiplication in hy ¢(I"(A)) 
satisfy all ring axioms. Hence hy ¢ (I"(A)) is a ring. 

Second, as an additive group, hg ¢(I'(A)) is a subgroup of Mat(n,Z) isomorphic 
to the free abelian group Z”*", which implies that it is a free abelian group with at 
most n* generators, and hence it is a finite module. 

Third, this module contains the identity matrix Id. 

And finally, the elements Id, €,&?,...,€”"~! are linearly independent over Q, and 
hence the set hy ¢(I"(A)) is a complete module. Therefore, by definition, h4 ¢(I"(A)) 
is an order. 


Theorem 21.9. Consider an arbitrary integer matrix A with characteristic polyno- 
mial irreducible over Q. Let € be one of the eigenvalues of A. Then the Dirich- 
let group =(A) is isomorphic to the multiplicative group of units in the order 
hag (I (A)) of the field Q(§ ). The isomorphism is realized by the map hg ¢ as above. 


Proof. Recall that hy ¢ is a one-to-one map between = (A) and hy ¢(=(A)). 

Letting p(§) < ha g(I'(A))) be a unit, i-e., it is invertible, then there exists an 
element g € P(A) such that p(&)-q(&) = 1. Hence g = p~! in the quotient ring 
Qlx]/Xa(x), and 

p(A)-q(A) = Id. 
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Therefore, the matrix p(A) is invertible in Mat(n, Z), and hence it is in = (A). 
Suppose now that p(A) € =(A) has an inverse in 4 (A) that is q(A). Let us restrict 
these matrices to the eigenspace corresponding to &. We have directly p(&)-q(€) = 
1. Thus, p(&) is a unit. 
Hence, the Dirichlet group =(A) is isomorphic to the multiplicative group of 
units in the order hy ¢(I"(A)), and the isomorphism is given by hy ¢. 


From Dirichlet’s unit theorem (Theorem 21.6) and Theorem 21.9 we have the 
following description of the Dirichlet group 2 (A). 


Corollary 21.10. Consider an arbitrary integer matrix A with characteristic poly- 
nomial irreducible over Q. Suppose it has s real and 2t complex conjugate eigen- 
values. Then there exists a finite abelian group G such that 


E(A)=Goezt!, 


21.2.3 Dirichlet groups related to orders that do not have complex 
roots of unity 


A polynomial is called cyclotomic if its roots are all distinct primitive nth roots of 
unity for some integer n. 

A cyclotomic polynomial is an irreducible polynomial. It is usually denoted by 
®,. These polynomials form only a very small subset of all polynomials. For in- 
stance, for all n > 2, the degrees of ®, are even, hence if we study matrices with 
irreducible characteristic polynomials in odd dimensions, then we do not have any 
orders containing roots of unity other than +1. In fact, the degree equals the Euler 
totient @(n) (the number of positive integers less than or equal to n that are coprime 
to n). The sequence of Euler totients (@(n)) converges to infinity as n tends to infin- 
ity. Hence for each even d there are only finitely many cyclotomic polynomials of 
any fixed degree k. 

If the Dirichlet group =(A) related to orders does not have complex roots of 
unity, then we directly have 


&(A) = (Z/2Z)' ez} 


for some integer k. Since the matrix —Id is in = (A), we have k > 1. 

In these cases, for the positive Dirichlet group, we have =; (A) = Z**'—!, since 
all symmetries about the plane have negative eigenvalues and every subgroup of a 
free abelian group is free abelian. 

In the special case that all the eigenvalues of A are real we have the following 
statement (we call such a matrix a real spectrum matrix). 
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Proposition 21.11. Consider an arbitrary integer real spectrum matrix A € SL(n,Z) 
with characteristic polynomial irreducible over Q. Then =4(A) =Z""}. 


Proof. Every generator of 2,(A) is a matrix with positive eigenvalues. Therefore, 


the operator is not cyclic. Hence 2,(A) is a free abelian group. Since for every 
B ¢ =(A) we have B* € =, (A), the rank of 2, (A) isn—1. 


21.3 Calculation of a basis of the additive group I (A) 


In this subsection we show how to calculate the basis of an additive group (A) 
for an arbitrary integer matrix A with irreducible characteristic polynomial. Let us 
identify the space Mat(n, IR) with the space RR? and consider the standard metrics 
for this space (the distance between two matrices is the Euclidean distance between 
the corresponding points in R”). Then every integer matrix corresponds to some 
integer point of R” In Proposition 21.12 below we show that the group I"(A) is 
isomorphic to Z”*!. Further, we construct a basis of a maximal-rank subgroup in 
I’(A). Finally, we choose a basis in the parallelepiped generated by this basis. So 
the algorithm can be performed as follows. 


Algorithm to calculate a basis of (A). 


Input data. In the input we have an arbitrary integer matrix A with irreducible 
characteristic polynomial. 


Goal of the algorithm. To calculate a basis of the group I'(A). 
Step 1. Since matrices Id,A,A7,...,A” are linearly independent over Q, they form 
a basis of some sublattice of (A). 


Step 2. To reduce all the calculations we apply the LLL-algorithm to the basis of 
Step | to get a reduced basis with much smaller coefficients. 


Step 3. Calculate a basis of I"(A) starting with the basis of a sublattice of (A) 
constructed in Step 2. 


Output. The basis of I"(A). 


21.3.1 Step 1: preliminary statements 
Proposition 21.12. For every integer irreducible real spectrum matrix A the set 
T(A) forms an additive group isomorphic to Z"*", 


Proof. In the diagonal basis, the group of all matrices commuting with A is iso- 
morphic to R”*! by addition. Hence the set of all integer matrices forms an integer 
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lattice in this (n+1)-dimensional subspace. So the group is isomorphic to Z* with 
k<n+l. 

The matrices Id,A,A”,...,A” are linearly independent over Q since the charac- 
teristic polynomial of A is irreducible over Q. Hence, k=n+1. 


Corollary 21.13. The group I'(A) is the intersection of the integer lattice Z'*" C 
Mat(n,R) with the space Span(Id,A,A’,...,A”). 


21.3.2 Step 2: application of the LLL-algorithm 


Applying the LLL-algorithm described below to the sublattice generated by the ma- 
trices Id,A,A?,...,A”, one constructs a reduced basis. This will decrease the Eu- 
clidean lengths of basis vectors generating the sublattice (see Section 21.5 for the 
full-rank lattice algorithm and Remark 21.18 for the non-full rank lattice algorithm). 
In general it is not necessary to use the LLL-algorithm here, so that one can proceed 
directly to Step 3. 


21.3.3 Step 3: calculation of an integer basis having a basis of an 
integer sublattice 


In Step 2 we end up with a basis M),...,M),+1 of a full-rank sublattice in the space 
Span(Id,A,A?, ...,A”). Let us show how to calculate effectively a basis of the inte- 
ger lattice in the span. 

We do this inductively. 


Base of induction. We take M = iM 1 as the first vector of the basis, where A is the 
greatest common divisor of all the element in M1. 


Step of induction. Suppose we have calculated the integer basis M,,...,My_; in 
Span(M,,...,My_1). Let us extend it with M; to a basis of an integer lattice in 
Span(M,,...,M;). As My we choose the only integer vector of the following form: 


1 k-1 


Me=>M.+Y 
k 


Ai— 
—Mi, 
d i=l di 


where d; is the integer distance from the ith point of the set M,,...,M,_1,M, to the 
span of the remaining points, and for integers A; we have 0 < A; < dj. Since 


ld(M,,Span(M,...,My—1)) = 1, 


the matrices M,,...,M, form a basis of the lattice in Span(M,,...,Mx). 
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The distances d; are calculated by the formula of Theorem 18.9, and the inte- 
ger volumes in coordinates are the greatest common divisors of the corresponding 
modified Pliicker coordinates (see Theorem 18.30). The numbers A; are found by 
exhaustive search. 


So in n+1 steps we obtain a basis of I’(A) that is the integer lattice in the space 
Span(Id,A,A’,...,A”) according to Corollary 21.13. 


21.4 Calculation of a basis of the positive Dirichlet group =,(A) 


From the algorithmic point of view this step is the most complicated. We describe 
only the idea for one of the simplest algorithms here and give the corresponding 
references. 

Let 7(x) be the characteristic polynomial of the matrix A and let € be one of the 
roots of 7 (x). Consider the map 


hag: &(A) > QUE] 


as in Section 21.2.2. By Theorem 21.9 this map is an isomorphism between the 
ring =(A) and image hy ¢(=(A)), which is the multiplicative group of units. In 
the book [26] the authors show how to construct a basis for the units of an order 
and a number p such that the norms of all its elements are bounded from above 
by p (here the norm is the standard norm on the order considered as R**‘). The 
method of constructing the constant p is standard and quite technical, so we omit it. 
Now according to [26] we construct a basis by enumeration of all vectors of the set 
h(=(A)) inside the ball Bp(O), where Bp (O) is a p-neighborhood of the origin. The 
preimage (i.e., h~!) of this basis gives us a basis of the group of invertible elements 
in the ring 2(A), and hence it gives a basis of the subgroup 2, (A). 


Remark 21.14. The constant p is extremely large (it equals the exponent of some 
polynomial of the coefficients of the matrix A). An effective algorithm for this step 
can be found in the book [41] by H. Cohen. Using this algorithm one finds the basis 
of units in polynomial time (with respect to the coefficients of the matrix A). 


21.5 Lattice reduction and the LLL-algorithm 


Whenever we are facing computational aspects in terms of the coefficients in some 
lattice basis, it is useful first to choose the basis in a way that it has vectors that 
(in some standard norm) as small as possible and then to perform the calculation. 
In 1982, A.K. Lenstra, H.W. Lenstra, and L. Lovasz in [141] proposed a notion of 
a reduced basis and an algorithm to calculate it. Later the algorithm was named 
the LLL-algorithm in honor of its inventors. We present the original LLL-algorithm 
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below and refer to [27] and [41] for further information. In this subsection we denote 
the Euclidean length of a vector v by |v]. 


21.5.1 Reduced bases 


Let bj,...,b, be a basis for L. Define inductively 


i-1 (b;,b*) 
b; =b;- y Hijd;, where Hij = 75 oe 
=e (05,53) 
(by (v,w) we denote the inner product of two vectors). Actually, the vectors bj,...,b% 


are the resulting vectors in the Gram—Schmidt orthogonalization process (and yj, ; 
are the coefficients that arise in the orthogonalization process). 


Definition 21.15. A basis b;,b2,...,b, for a lattice L is said to be reduced if the 
following two conditions hold: 

(i) Size reduced conditions: for 1 < j <i <a, |uij| < 5. 

(ii) Lovasz conditions: for | <i <n, |b¥ + pj i-1b7_, es Zoe, 

Letting L be a lattice generated by b1,...,b,, denote by d(L) the determinant of 
the lattice: 


d(L) = |det(b1,...,bn)}. 


Let us estimate the lengths of the vectors in a reduced basis via the orthogonal basis 
b* and the determinant of L. 


Theorem 21.16. (A.K. Lenstra, H.W. Lenstra, and L. Lovasz [141].) 
Let by, b2,...,bn be a reduced basis for a lattice L in R". Then we have 
(i) |b;| <2 "BFP forl<j<i<n 


(ii) d(L ) <TH [eu] <2" D/4d(L), 
(iii) by] <2 Aaa), 


Proof. From the size reduced conditions and the Lovasz conditions, we have 


3 1 
bP > (F- 2. ) [bi > _|p* i 
| i| “\4 Hii 1 | i i 5 i il 
for all admissible i, and therefore, 
joy? <2’ J[oe/?, for 1<j<i<n. 


From the size reduced conditions, it follows that 


|bi? = r+ as < bale 2h bE? = (2°? 41/2) < 2° BFP. 


j=l 
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This implies the first item of Theorem 21.16: 
2 j—1 2 i—1 2 
bi S27 BG <2 BF. 


By definition of the basis b},b3,...,%, it follows that 
n 
d(L) = | det(b},b3,...,bn)| =] T1271, 
i=l 


the last equality holding since the vectors b; are mutually orthogonal. Now the in- 
equalities of the second and third items of Theorem 21.16 follow directly from the 
inequalities of the first item and the fact that |by| < |bi]. 


21.5.2 The LLL-algorithm 


One can think of this algorithm as a type of generalization of Euclid’s algorithm. 
We use the following notation. Letting r be the nearest integer to |L;|, denote 
by [b,]; the vector by — rby. 


LLL -algorithm. 


Input data. As input we have a basis of the lattice b;. The additional orthogonal- 
ization data (bj, 1;,;) is defined by b; as above. The algorithm is iterative. At each 
iteration we replace the basis b; by a new one. Whenever we change the basis b; we 
should update the orthogonalization data (b*, uj,;). 


Goal of the algorithm. To calculate a reduced basis of the lattice generated by the 
basis bj. 


Parameters and conditions of the iteration process. An iteration has one parame- 
terk € {1,2,...,n+1}. For the first iteration we put k = 2. The algorithm terminates 
when k =n+1. After each iteration we have the value of k and the following con- 
ditions satisfied: 

(i). For 1 < jf <i<k, |uis| < 5. 

(jig). For 1 <i<k, |b¥ + pii-1b*_,|? > 2b¢_4. 


Description of the iteration. We have the basis b; and the parameter k € {1,2,...,n} 
(if k = n+l, the algorithm terminates). First, if k > 1, we replace by by [bx]x—1 
and respectively update (b?, ui;). Now |Uxx—1| < 5. Second, we choose one of two 
cases to continue. 

Case 1. We choose this case if 


3 
|b} + uie—1bh 4° < q2k-1 and k>2. 
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We interchange by_; and b, and update (b*, u;,;). Finally, we reduce k by one (k := 
k — 1) and exit the iteration. 
Case 2. We choose this case if 


3 
|b? + oii-1B7-1/? > qor-1 r= 1. 


Then we consequently replace by by [b,]; and respectively update (b7, u;,;) for 1 = 
k—1, k—2,...,1. Finally, we increase k by one (k :=k-+ 1) and exit the iteration. 


Output. The algorithm terminates once k = n+ 1. As output, we have a reduced 
basis of the lattice. 


It is important to notice that the algorithm terminates in finitely many iterations. 
The running time is bounded as follows. 


Theorem 21.17. (A.K. Lenstra, H.W. Lenstra, and L. Lovasz [141].) Let L C 
Z" be a lattice with basis by,b2,...,bn, and let B € R,B > 2, be such that |b;|* < 
B for 1 <i<n. Then the number of arithmetic operations needed by the basis 
reduction in the described LLL-algorithm is O(n*logB), and the integers on which 
these operations are performed each have binary length O(nlogB). 


We are not going to give a proof here, since it is quite technical. The interested 
reader is referred to the original manuscript [141]. 
Remark 21.18. The algorithm works also for the case of lattices of rank / <n. In 
this case it terminates when k reaches / + 1. 


21.6 Exercises 


Exercise 21.1. Prove that (A), considered as a group with matrix addition, is 
isomorphic to Z”*!. 


Exercise 21.2. Find all isomorphism types of Dirichlet groups for matrices in 
SL(n,Z) whose characteristic polynomials are irreducible over Q, where 
(a)n = 2; (b) n = 3; (c)n=4. 


Exercise 21.3. Prove that cyclotomic polynomials are irreducible over Q and of 
even degree (except for (x+1)). 


Exercise 21.4. Find the generators and relation of the Dirichlet group = (A), where 


010 010 
@yA=(001]; ®A=[001 
PSA 1-40 


| ® 
Check for 
updates 


Chapter 22 


Periodicity of Klein polyhedra. Generalization of 
Lagrange’s Theorem 


The sails of algebraic multidimensional continued fractions possess combinatorial 
periodicity due to the action of the positive Dirichlet group on the sails. In case 
of one-dimensional geometric continued fractions this periodicity is completely de- 
scribed by the periodicity of the corresponding LLS sequences (see Chapter 11). 
Around twenty years ago, V.I. Arnold posed a series of questions to study peri- 
odicity of multidimensional continued fractions in the sense of Klein. At this mo- 
ment some of his questions have answers, while others remain open. In this chapter 
we discuss current progress in this direction. The questions related to periodicity 
of algebraic sails are important in algebraic number theory, since they are in cor- 
respondence with algebraic irrationalities. In particular, periods of algebraic sails 
characterize the groups of units in the corresponding orders. 

First, we associate to any matrix with real distinct eigenvalues a multidimen- 
sional continued fraction. Second, we discuss periodicity of associated sails in the 
algebraic case. Further, we give examples of periods of two-dimensional algebraic 
continued fractions and formulate several questions arising in that context. Then 
we state and prove a version of Lagrange’s theorem for multidimensional contin- 
ued fractions. Finally, we say a few words about the relation of the Littlewood and 
Oppenheim conjectures to periodic sails. 


22.1 Continued fractions associated to matrices 


We begin with the following constructive definition. 


Definition 22.1. Consider an (n+1) x (n+1) matrix A whose eigenvalues are all 
distinct and real. Take the n-dimensional hyperplanes passing through the origin 
that are spanned by n linearly independent eigenvectors of A. There are exactly n+1 
such hyperplanes. These hyperplanes define the n-dimensional continued fraction 
called the multidimensional continued fraction associated to A. 
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Let us formulate an algebraic criterion of integer congruence of associated mul- 
tidimensional continued fractions. 


Proposition 22.2. Let A and B be matrices of GL(n + 1,R) with distinct real eigen- 
values. The continued fractions associated to A and B are integer congruent if and 
only if there exists a matrix X € GL(n+1,Z) such that XAX~! commutes with B. 


Proof. Let A and B be matrices of GL(n+ 1, R) with distinct real irrational eigenval- 
ues and suppose that their continued fractions are integer congruent. Since the con- 
tinued fractions are integer congruent, there exists a linear integer lattice-preserving 
transformation of the space that maps the continued fraction of A to the continued 
fraction of B. Under such a transformation the matrix A is conjugated by some inte- 
ger matrix X with unit determinant. All eigenvalues of the matrix XAX~! are distinct 
and real (since conjugations preserve the characteristic polynomial of the matrix). 
Since the invariant cones of the first continued fraction map to the invariant cones 
of the second one, the sets of eigendirections for the matrices B and XAX~! co- 
incide. Hence these matrices are simultaneously diagonalizable in some basis, and 
therefore, they commute. 
Let us prove the converse. Suppose that there exists X € GL(n+1,Z) such that 


A=XAX™! 


commutes with B. Note that the eigenvalues of the matrices A and A coincide. There- 
fore, all eigenvalues of the matrix A (just as for the matrix B) are real and distinct. 
Let us consider a basis in which the matrix A is diagonal. Simple verification shows 
that the matrix B is also diagonal in this basis. Hence the matrices A and B define 
the same orthant decomposition of R”’*!, and thus the same continued fraction as 
well. 


22.2 Algebraic periodic multidimensional continued fractions 


Now we formulate the notion of periodicity of continued fractions associated to 
algebraic irrationalities. Recall the following general definition. 


Definition 22.3. An integer matrix A is called irreducible if its characteristic poly- 
nomial is irreducible over Q. 


We write RS-matrix for real spectrum matrices. 


It is clear that every irreducible RS-matrix A has distinct eigenvalues. Hence there 
exists an associated multidimensional continued fraction to A. We work now only 
with integer matrices of the group SL(n+ 1,Z). 


Definition 22.4. An n-dimensional continued fraction associated to an irreducible 
RS-matrix A € SL(n+1,Z) is called an n-dimensional continued fraction of an 
(n + 1)-algebraic irrationality. The case n = 1(2) corresponds to one- (two-) di- 
mensional continued fractions of quadratic (cubic) irrationalities. 
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Consider an irreducible RS-matrix A € SL(n+ 1,Z). From Proposition 21.11 we 
know that the positive Dirichlet group 2,(A) is isomorphic to Z”. Each matrix of 
=, (A) preserves the integer lattice and the union of all n+ 1 hyperplanes, and hence 
it preserves the n-dimensional continued fraction. Since all eigenvalues are positive, 
the sails are mapped to themselves as well. The group =, (A) acts freely on any sail. 
The factor of a sail under such a group action is isomorphic to the n-dimensional 
torus. 


Definition 22.5. By a fundamental domain of the sail we mean the union of some 
faces that contains exactly one face from each equivalence class. 


In Fig. 22.1 we give an example of the periodic continued fraction associated to 
the matrix of the two-dimensional golden ratio (later we study this example in more 
detail; see Example 22.9 below): 


111 
122 
123 


In the top picture we have a fragment of the sail. In the middle there are two triangles 
of two different orbits (white ones and dark ones). Finally at the bottom we show 
one of the possible fundamental domains with respect to the action of the positive 
Dirichlet group. 


22.3 Torus decompositions of periodic sails in R* 


As we have already seen in the first part of the book (Chapter 11), the LLS sequence 
is a complete invariant of one-dimensional sails. The situation in the multidimen- 
sional case is much more complicated, since the combinatorial structure of sails 
is no longer as simple as in the one-dimensional case. A complete invariant is not 
known even for two-dimensional sails in R*. Further, in this section we discuss only 
periodic two-dimensional continued fractions in R°, recalling that such sails are 
homeomorphic to R* by Theorem 20.21. 


Definition 22.6. A class of integer congruences of certain objects in R” is called an 
integer affine type of these objects. 


Let us discuss the case of R?. It is not very hard to get an invariant that distin- 
guishes all sails. For instance, if we know 

— integer affine types of all faces in the sail; 

— integer affine types of the pyramids at all vertices of the sail; 

— additional data about one vertex vo: integer affine type of the union of the 
pyramid and the origin, 

then we can uniquely reconstruct the sail, if it exists. 


Let us briefly show how to do this. 
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Algorithm of sail reconstruction. 


Input data. As mentioned above. 


Goal of the algorithm. To construct the sail (actually some of its finite part, say a 
certain fundamental domain for periodic sails). 


Step 1. From the third condition we construct the pyramid at vo of the sail. 
Step 2. From the first condition we find all polygonal faces adjacent to vo. 


Inductive step. In previous steps we have found several faces of the sail. By the first 
condition identify all integer affine types of pyramids that are adjacent to the con- 
structed set of faces (at least in two edges each). By the second condition construct 
all the faces in new pyramids. 


Output. In a finite or countable number of steps we build the whole sail (up to 
integer congruence). 


Remark 22.7. In the case of algebraic irrationalities all the integer affine types of 
the faces and pyramids would repeat periodically. Hence we need to store only a 
finite amount of data regarding a single period. 


This data set is far from forming a complete invariant of sails. It has many mon- 
odromy conditions, so the majority of cases are nonrealizable. So the natural ques- 
tion here is as follows: how to find whether a certain data set is realizable? 

Recently, V.I. Arnold proposed a weaker invariant to distinguish the periodic 
sails. 


Definition 22.8. A torus decomposition corresponding to a periodic two-dimen- 
sional sail is the factor-torus face decomposition of the sail equipped with integer 
affine types of faces and integer distances to them. 


V.I. Arnold conjectured that torus decompositions of integer noncongruent sails 
are distinct. For all checked integer noncongruent sails this conjecture is true. (See 
Conjecture 18.) Here we should mention that for torus decompositions, the realiz- 
ability problem is not yet studied. 

In the multidimensional case one can define similar torus decompositions. The 
important question is then to find which decompositions are realizable and which 
are not realizable. 


22.4 Three single examples of torus decompositions in R° 


In this and the next section we study several examples. We construct one of the sails 
for each example. 


Example 22.9. We start with the simplest example that generalizes the regular con- 


tinued fraction corresponding to the golden ratio ees Recall that the geometric 
continued fraction corresponding to the golden ratio is associated to the matrix 
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11 
(12) 

All four one-dimensional sails of the one-dimensional continued fraction corre- 
sponding to the golden ratio are integer congruent. The associated continued fraction 
has sails whose LLS sequences are (...,1,1,1,...). This sequence has period (1), 
which is the simplest possible for one-dimensional sails. The corresponding circle 
decompositions consist of one vertex and one edge. 

The generalization of this one-dimensional continued fraction to the multidimen- 


sional case was given by E.I. Korkina in the work [129]. The multidimensional con- 
tinued fraction associated to the matrix 


1il---1 
122---2 
123.--3 
123---k 


is called the generalized golden ratio. 


Remark 22.10. Unfortunately, the generalized golden ratios do not always give the 
simplest periodic continued fractions in all dimensions, since the characteristic poly- 
nomials are not always irreducible. For instance, there are nonconstant factors if the 
dimension is k = 4, 7, 10, 12, 13, 16, 17, 19 for k < 20. Further, we study the case 
k = 3, which corresponds to an irreducible matrix. The case of the simplest matrices 
for k = 4 was studied in [99]. 


Consider the continued fraction associated to the generalized golden ratio matrix 


111 
M={122 
123 


The torus decomposition corresponding to this matrix is homeomorphic to the fol- 


lowing one: 
D C 


A B 


Here the segment AB is identified with the segment DC and the segment AD with the 
segment BC. In the picture we show only homeomorphic types of the faces (without 
lattice structure). The integer affine types of the corresponding faces are given in the 
next picture; both triangles have integer affine types of the simplest triangle: 
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N x 
A B B C 


The positive Dirichlet group =,(M) is generated by the two matrices 


111 101 
122 and 021 
123 112 


The integer distance from the triangle ABD to the origin equals 2, and from the 
triangle BCD it equals 1. 

This torus decomposition was found by E.I. Korkina [129], G. Lachaud [136], 
and A.D. Bryuno and V.I. Parusnikov [31] approximately at the same time. 


Example 22.11. The second example was studied by A.D. Bryuno and V.I. Parus- 
nikov [31]. They constructed the continued fraction that is associated to the follow- 
ing matrix: 
111 
M= 11-10 
100 


The positive Dirichlet group =, (M) is generated by the following two matrices: 
X=M’, Y=2Id—M?’. 


The torus decomposition corresponding to this matrix is also homeomorphic to the 


following one: 
D C 


A B 


Here the segment AB is identified with the segment DC and the segment AD with the 
segment BC. The triangle ABD has the integer affine type of the simplest triangle. 
The triangle BCD has the integer affine type of the triangle with vertices (—1,—1), 
(0,1), and (1,0). 
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The integer distance from the triangle ABD to the origin equals 2, and from the 
triangle BCD it equals 1. 


Example 22.12. The third example was given by V.I. Parusnikov [172]. This con- 
tinued fraction is associated to the following matrix: 


01 0 
M= {00 1 
11-3 


The positive Dirichlet group =,(M) is generated by the following two matrices: 
X=M*, Y=3Id—2mM7!. 


The torus decomposition corresponding to this matrix is also homeomorphic to the 
following one: 


Here the segment AB is identified with the segment DC, and the polygonal line AGD 
with the polygonal line BEC (the point G is identified with the point £). All triangles 
have the integer affine type of the simplest triangle. The pentagon BEF DG has the 
integer affine type of the pentagon with vertices (—1,0), (—1,1), (0,1) (1,0), and 
(1,-1): 


G --+ iN iN 
A B C D C F 


The integer distances from the triangle ABC and the pentagon BEF DG to the origin 
equal 1, from the triangle CDF to the origin equals 2, and from the triangle CFE 
equals 3. 


The continued fractions constructed in Examples 22.9, 22.11, and 22.12 are also 
known as the continued fractions corresponding to the first, the second, and the third 
Davenport forms. Several other single torus decompositions for two-dimensional 
continued fractions were investigated in the works [175], [173], and [174] by 
V.I. Parusnikov. 


For a matrix A we denote by ||A|| the sum of absolute values of all the coefficients 
for the matrix. Then we have the following theorem (for more information see [97]). 
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Theorem 22.13. Let A € SL(3,Z) be an irreducible RS-matrix. 
(i) Al > 4. 
(ii) If ||A|| =5 (there are 48 possible matrices), then the corresponding continued 
fraction is integer congruent to the generalization of the regular fraction for the 
golden ratio. This is shown in Example 22.9. 
(iii) For the case ||A|| = 6 there are 912 different possible matrices. Consider the 
associated two-dimensional continued fractions: 
— 480 continued fractions are integer congruent to the one in Example 22.9; 
— 240 of them are integer congruent to the continued fraction of Example 22.12; 
— 192 of them are integer congruent to the continued fraction of Example 22.11. 


The classification of two-dimensional continued fractions with norm greater than 
or equal to seven is unknown. 


22.5 Examples of infinite series of torus decomposition 


We continue with several infinite series of torus decompositions. The first two in- 
finite series were calculated by E.I. Korkina in [129]. The first of them is shown 
below. 


Example 22.14. The continued fractions of this series are associated to the follow- 
ing matrices for a > 0: 


00 1 
M,=|10-a-—5 
01 a+6 


The positive Dirichlet group =,(M,) is generated by the following two matrices: 
X,=Ma,  Ya=(M,—Id)’. 


The torus decomposition corresponding to M, is homeomorphic to the following 


one: 
D C 


A B 


Here the segment AB is identified with the segment DC and the segment AD with the 
segment BC. The integer distance from the triangle ABD to the origin equals a+ 2, 
and from the triangle BCD it equals 1. 

Both triangles have integer affine types of the simplest triangle: 
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--+ AX 


A B B C 


Several other examples of infinite series of continued fractions were studied 
in [96], some of which we now present. The following series generalizes the one 
from the previous example. 


Example 22.15. This series depends on two integer parameters a,b > 0. Consider 
the continued fractions associated to the following matrices: 


0 1 0 
Map = | 0 0 1 
1 1+a—b —(a+2)(b+1) 
The positive Dirichlet group =,(M,») is generated by the following two matrices: 
Xab=Mz5, Yan =Mz4(Mz4 — (b+ 1)ld). 
The torus decomposition corresponding to M, » is as follows: 


D C 


A B 


Here the segment AB is identified with the segment DC and the segment AD with 
the segment BC. Both triangles have the same integer affine type of the triangle with 
vertices (0,0), (0,1), and (b+ 1,0) (b =5 in the picture): 


The integer distance from the triangle ABD to the origin equals a+ 2, and from the 
triangle BCD it equals 1. 


Remark. If we substitute b = 0 in Example 22.15, then we have a series of matrices 
whose associated continued fractions are integer congruent to the continued frac- 
tions associated to the series of matrices of Example 22.14. We formulate this more 
precisely in the next proposition. 


Proposition 22.16. The continued fractions associated to the following two matri- 
ces are integer congruent (for integers a > 0): 
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0 1 0 00 1 
Mao=|90 0 1 , M,=|10-a-5 
11l+a-—a-—2 01 a+6 


Proof. The matrices (Id —Mao)7! and M/, are conjugate by the matrix X in the 
group SL(3,Z): 


—1-1-2 
X=] 0 0-1 
1 0 -l 


(i.e., M! =X ol (Id — Mao) 'X ). Therefore, the corresponding continued fractions 
are integer congruent by Proposition 22.2. 


Example 22.17. The series of this example depends on an integer parameter a > 1. 
The continued fractions of the series are associated to the following matrices: 


01 0 
Map = | 90 1 
1 a—2a—3 


The positive Dirichlet group =,(M,) is generated by the following two matrices: 
Xa=M7?,  Yg= (2Id—M;z?)"'. 


The torus decomposition corresponding to M, is as follows: 
D C 
A B 
Here the segment AB is identified with the segment DC and the segment AD with 


the segment BC. All four triangles have integer affine types of the simplest triangle: 


ix iN kK ix 
A B D B E B D E 


The integer distance from the triangles ABD, BDE, and BCE to the origin are equal 
toa+2,a+1, and 1, respectively. 


Example 22.18. The series of this example depend on two integer parameters a > 0 
and b > 0. The continued fractions of the series are associated to matrices 


22.6 Two-dimensional continued fractions associated to transpose Frobenius normal forms 307 


0 1 0 
Map = | 9 0 1 
1 (a+2)(b+2)—3 3-—(a+2)(b+3) 
The positive Dirichlet group =, (M,») is generated by 


Xab = ((b+3)Ild—(b+2)Mz5)M,5 and Ya» =M;;- 


The torus decomposition corresponding to this matrix is homeomorphic to the fol- 


lowing one: 
D C 
BC 
F E 
ee 
A B 


Here the segment AB is identified with the segment DC and the polygonal line AFD 
with the polygonal line BEC (the point F is identified with the point £). The integer 
affine types of the faces ABF, CGE, CDG, BDF, and DBEC are as follows: 


The integer distance from the triangles ABF, BFD, CDG, CEG to the origin 
equals 1, 1, 2+2a+2b+ ab, and 3+2a+2b-+ ab respectively. The quadrangle 
DBEC is at unit distance from the origin. 


For further details related to such series we refer to [96]. 
22.6 Two-dimensional continued fractions associated to 
transpose Frobenius normal forms 


Let us consider the family of transpose Frobenius matrices (or transpose companion 
matrices) * 
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where m and n are integers; see Fig 22.2. 


Remark 22.19. Suppose that the characteristic polynomial 7,,, (x) is irreducible 
over the field of rational numbers Q. Then the operator of multiplication by the 
element x in the field Q[x]/(%4,, ,x)) in the natural basis {1,x,x?} has the matrix 
Amn- 


Let us observe some basic properties. Denote by the subset of irreducible RS- 
matrices of this family. 


Proposition 22.20. The set Q is defined by the inequality 


nm — 4m? + 4n> — 18mn —27 < 0. 


In addition it is necessary to subtract all the matrices of types: Aga and Aga+2, 
a € Z (these are the matrices with reducible characteristic polynomials). 


Notice that the set Q has the following symmetry. 


Proposition 22.21. The two-dimensional continued fractions for the cubic irra- 
tionalities constructed by the matrices Am» and A_n,m are integer congruent. 


Remark 22.22. There are some other integer congruences of continued fractions in 
Q. For instance, the matrices Ag_, and A_yq— @. have integer congruent continued 


fractions (since the matrices Ab , and A_,, 2 are integer conjugate). 


Observe that continued fractions of transpose Frobenius matrices do not cover all 
possible integer congruence classes of continued fractions for arbitrary RS-matrices 
in SL(3,Z). For instance, the continued fraction of the following matrix is not inte- 
ger congruent to that of a Frobenius matrix: 


120 
A=|012 
—70 29 


(we discuss similar questions later, in Chapter 25). So the following question is of 
interest. 


Problem 17. How often are continued fractions integer congruent to those of the 
set Q? 


Finally, in Fig. 22.2 we give a table (introduced in [97]) whose squares are filled 
with torus decompositions of the sails for continued fractions associated to the ma- 
trices of Q. Integer affine types of all the faces of the torus decomposition for one of 
the sails for A,,, is shown in the square at the intersection of the row with number n 
and the column with number m. If one of the roots of the characteristic polynomial 
for the matrix equals | or —1, then the corresponding square is marked with the sign 
* or # respectively. The squares that correspond to the matrices whose characteristic 
polynomial has two complex conjugate roots are shaded in light gray. 


Remark. Notice that some of the series constructed above are clearly seen in 
Fig. 22.2. 
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22.7 Some problems and conjectures on periodic geometry of 
algebraic sails 


We begin with a problem on a complete invariant of sails. In the one-dimensional 
case the answer is known, since the LLS sequence is a complete invariant of sails. In 
dimensions two and higher the problem is open (in both the periodic and the general 
cases). For the periodic case we have the following conjecture and open problem. If 
the conjecture is true and the problem is solved, then we have a complete invariant 
for two-dimensional periodic sails. 


Conjecture 18. (V.I. Arnold.) Torus decompositions of integer noncongruent sails 
are distinct. 


Problem 19. (V.I. Arnold.) Describe all torus decompositions that are realizable 
by periodic two-dimensional continued fractions. 


Only a few examples are known in this direction. Let us give several trivial ex- 
amples of torus decompositions that do not correspond to sails. 


Example 22.23. The following torus decomposition is not realizable by sails of 
periodic continued fractions. There is one integer point in the interval AD, which 
we denote by E: 

D F C 


A E B 


Here the polygonal line AEB maps to the polygonal line DFC under one of the 
matrices of the group SL(3,Z) that preserves the sail. Since AEB is a segment, DFC 
is also a segment. Therefore, the points B, C, F, and D lie in the same plane. And 
hence BF is not an edge of some face. 

Now we describe the first nontrivial example of torus decomposition, which was 
given by E.I. Korkina in the work [129]. 


Example 22.24. Consider the simplest torus triangulation. It consists of two trian- 
gles with the simplest integer affine type of the triangle (0,0), (0,1), and (1,0). 


D C 


A B 


The integer distances to both faces are 1. This decomposition is not realizable for 
periodic sails. 
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This example (together with many other ones) allows us to formulate the follow- 
ing conjecture. 


Conjecture 20. For every torus decomposition for the continued fraction of a cubic 
irrationality there exists some face with integer distance to the origin greater than 
one. 


On the other hand, every known (periodic) sail has a face with the integer distance 
to the origin equalling one. 


Conjecture 21. For every torus decomposition for the continued fraction of a cubic 
irrationality there exists some face whose integer distance to the origin equals one. 


The following example is a torus decomposition that consists of a single face. 


Example 22.25. Consider the torus decomposition consisting of one face with in- 
teger affine type of the simplest parallelogram with the vertices (0,0), (0,1), (1,1), 
and (1,0). 

D C 


A B 


The integer distances to this face can be chosen arbitrarily. This decomposition is 
not realizable for periodic sails of cubic irrationalities. 


It seems that a torus decomposition with a single rectangular face is not realizable 
in general. Here we conjecture a stronger statement. 


Conjecture 22. The torus decomposition for every sail of every cubic irrationality 
contains a triangular face. 


The next question is related to continued fractions of the same cubic extension. 


Problem 23. (V.I. Arnold.) Classify continued fractions that corresponds to the 
same cubic extensions of the field of rational numbers. 


Almost nothing is known here (even in the one-dimensional case). For example, 
even the finiteness of the number of possible continued fractions associated to the 
same extension is unknown. (For properties of cubic extensions of rational numbers 
see in the work of B.N. Delone and D.K. Faddeev [58].) Here is a nice example 
showing that a characteristic polynomial does not distinguish two-dimensional con- 
tinued fractions. 


Example 22.26. The following two matrices having the same characteristic poly- 
nomial x? + 11x? — 4x— 1 (and hence the same cubic extension of Q) define integer 
noncongruent continued fractions: 


3 01 0 


A sll 23 00 1 
(=o 14-11 
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22.8 Generalized Lagrange’s Theorem 


The combinatorial topological generalization of Lagrange’s theorem was announced 
by E.L. Korkina in [127]; a complete proof of it was given in 2008 by O.N. German 
and E.L. Lakshtanov in [77]. A slightly weaker algebraic generalization was pro- 
posed in 1993 by G. Lachaud; see [134], [135], and [136]. In this chapter we present 
a version of the combinatorial topological generalization of Lagrange’s theorem. 


Definition 22.27. Let C be a cone and v a vertex of the sail for C. An edge star of v 
is the union of all edges of the sail adjacent to v; we denote it by St,. 
We say that a star is regular if v is in the interior of the cone C. 


Consider a star St, and a point p (not necessary integer) in the complement to 
St,. We say that St, is a regular star with respect to p if 
— vis in the interior of conv(St,, p). 
— the set conv(St,, p) \ (conv(St,) U {p}) does not contain integer points. 
Define 
I’ (St,) = {p|St, is regular star with respect to p}. 


Theorem 22.28. Let St, be a regular star. Then the set I (St,) is bounded. 


Proof. We prove the statement by induction on the dimension of the star. 


Base of induction. The statement clearly holds for every star (i.e., for one point) in 
R!. 

Step of induction. Let the statement hold for every star in R¢~!; we prove the state- 
ment in R?. 

Consider an arbitrary regular star St, in R¢. Suppose I'(St,) is not closed. Then 
there exists a (unit) direction € that is an accumulation points of unbounded direc- 
tions, i.e., for every neighborhood U c S¢~! of & and every N > 0 there exists a 
point p € I'(St,) with direction in U and |vp| > N. Hence the half-prism 


conv(St,) 6 {Ag |A > 0} 


does not contain integer points in the interior except for the points of conv(St,). 
There are two different cases to consider here. 

The first case is v being inside the half-prism. Then either & is rational and the 
cone contains integer points, or & is irrational and then the half-prism contains in- 
teger points by the multidimensional Kronecker’s approximation theorem (Theo- 
rem 20.16). We have arrived at a contradiction. 

In the second case we have v on the boundary of the half-prism. This mean that & 
is in the plane of some face si of St,. Let ZL be the integer hyperplane containing 
Sige: The direction € is also an accumulation points of unbounded directions in 
T(St4—'), since the intersections of pyramids with direction close to € with L are 
also unbounded. This contradicts with induction assumption. 

Hence the set I’ (St,) is bounded. 
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Corollary 22.29. The set I (Sty) contains finitely many integer points. 


Definition 22.30. Let C be a cone in R” and let Sc be its sail. A two-sided infinite 
sequence (v;) of vertices in the sail Sc in R” is called a chain of Sc if for every 
integer k we have 

— vz and vz,1 are connected by an edge of the sail; 

— the points v;,...,Vern—1 are affinely independent (i.e., they span the (n—1)- 
dimensional affine plane). 


Definition 22.31. Let C be a cone in R", Sc its sail, and (v;) a chain in Sc. Consider 
an arbitrary subset U of Aff(n,Z). We say that the chain (v;) is U-periodic if there 
exists a period ¢ such that for every i there exists A; € U satisfying 


Ai(St,,) =(Sty,,,) for j=ii+l,...,i+n. 


Denote by H,, the set of affine transformations Ax + b, where A is an integer 
irreducible RS-matrix in GL(n,Z) (and hence it has distinct eigenvalues). Let H? 
denote the subset of H,, with b = 0. Since we have a fixed basis in R”, we identify 
linear transformations with matrices in the fixed basis. 


Theorem 22.32. (Geometric generalization of Lagrange theorem.) Consider an 
irrational cone C C R" and let Sc be its sail. The following two statements are 
equivalent: 

— there exists a GL(n,Z)-transformation A € H® such that A(C) =C. 

— there is an H,-periodic chain of vertices of the sail Sc. 


Remark 22.33. A stronger similar statement holds for the class of all affine trans- 
formations whose corresponding linear transformations satisfy: 

— all eigenvalues are distinct from 1; 

— all pairs of complex conjugate eigenvalues are distinct and distinct from the 
absolute values of real eigenvalues. 
To avoid technical details we restrict ourselves to the class of affine transformations 
with irreducible RS-matrix in GL(3,Z), i.e., to H,. For more information on the 
general case we refer to [77]. 


Lemma 22.34. Let (v;) be an H,-periodic chain of vertices for some sail S in R". 
Then there exists a transformation A in H® establishing a nontrivial shift of stars, 
i.e., there exists a positive integer T such that for every k we have 


A’ (Sty,) =Sty,7- 


Proof. First, we prove that there exists an affine transformation in H,, establishing a 
nontrivial shift of the union of all the stars along itself. 

Set B; = Aj, Ai. Notice that fork = 1,...,n we have B;(St,,) = St,,, in particular 
B;(vx) = vg. Though it is not clear whether B; acts on the stars as the identity map or 
as a symmetry, nevertheless we know that it preserves the set of all elementary vec- 
tors corresponding to the edges of the stars. Denoting by 7; the sum of all primitive 
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vectors corresponding to the edges of the star (Sty, ), let s; =v; +r). Notice that, 
first, B;(s1) = 51, second, s; is not contained in F. 

Consider now the (n — 1)-dimensional plane spanning the face F and the point 
v, denoted by z. The complement to the plane z contains at least one edge of the 
star St,,. Denote by r2 the sum of all elementary vectors for edges of the star St,, 
in the half-space containing this edge, which we denote by 7. Since B; is a proper 
affine transformation preserving the plane 7, it preserves the half-space 7. Thus it 
preserves the set of all elementary vectors of the star St,, that are in this half-space, 
and in particular, the vector r. Let s2 = vj + ro. We have B;(s2) = 5. 

From all of the above, the transformation B; preserves the n+1 points vj,...,V,—1, 
rj, and rz. By the condition on v; and construction of r; and r2, these points span R”. 
Therefore, B; is the identity transformation and A; = A;+1. So the transformation Aj 
(in H,,) acts on the stars as a shift. 


Let us find now a linear transformation acting on the stars as a shift. For an 
arbitrary integer k, we consider a transformation A;* . This transformation sends St,, 
to St,, and the set of all admissible vertices I'(St,,) to '(St,, ). By Theorem 22.28, 
I'(St,,) is bounded, and since it contains only integer points, I’(St,, ) is finite. Let 
this set have N points. Consider the set 


{Ak(O) |k =0,...,N}. 


All these points are in I’(St,,,), since A‘ (St,,,-~) = St,. Since the stars St,, and St,,, 
are integer congruent, the set '(St,,,) contains exactly N points. Therefore, there 
exist distinct i and j such that Ai(O) = Aj(O). Hence A‘~/(O) = O, meaning that 
A'/ is a linear transformation in H,, and thus in H°. 

The transformation A‘~/ € H? establishes a nontrivial shift of the union of all the 
stars (St,,) along itself. 


Proof of Theorem 22.32. Suppose that there exists an SL(n, Z)-transformation A € 
HP such that A(C) = C. Then for an arbitrary vertex of the sail v the chain (A"(v)) 
with integer parameter n is H,-periodic. 


The converse statement is more complicated. Suppose that there exists an un- 
bounded, in both directions (as an ordered set in IR”), H,-periodic chain of vertices 
(v;) of the sail Sc. Then by Lemma 22.34 there is a transformation A in H? estab- 
lishing a nontrivial shift in the sequence of the stars St,,. Let us prove that A(C) =C. 

Let us consider an (n—1)-dimensional tetrahedron T = v; ...v,. Notice that T 
is in the sail, and hence the hyperspace containing it does not contain the origin 
O. Since A" (T) = Ving1---Vm4n is in C, the tetrahedron T is in the cone A~”(C). 
Let (e;) be an eigenbasis of A such that the corresponding eigenvalues (A;) form a 
decreasing sequence. Denote also by (w;) a basis corresponding to the cone C. 

Let v = dye, +--+ + Qye;, where o% 4 0. Then asymptotically, as m — +-co we 
have 

A™(v) = a%A;"e; +smaller terms; 
A™(v) = a(x) "er + smaller terms. 
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Therefore, the sequences of rays defined by the directions of A”(v) and A~” as 
m — +co converge to the rays with direction e, and e; respectively. (Here to define 
the convergence we consider the topology of the unit sphere. A ray is represented 
by a point of intersection of this ray with the unit sphere.) 

By the above, the set (A”(w;)) should contain the tetrahedron T for every integer 
m. Therefore, the limiting cone at +c also contains T. Since the tetrahedron T is 
(n— 1)-dimensional and the hyperspace of T does not contain the origin, all the rays 
in the limit has linearly independent directions. Therefore, the rays are distinct and 
(possibly after some permutations of vectors e;) the matrix of vectors w; in the basis 
(e;) is lower triangular: 


O11 0 ware 0 0 
O21 O12 °°: 0 0 


On—1,1 An—-1,2 *** A%—1,n-1 0 
On,1 On2 °° Onn-1 Ann 


For the same reasons, all the rays of the limiting cone at —ce are independent. Hence 
for every i there exists a vector w; that is contained in the span of e],...,e;. This 
implies that all the coefficients under the diagonal are zeros. Thus, the matrix is 
diagonal, and all vectors w; are eigenvectors of A. Therefore, A(C) = C (recall that 
by Lemma 22.34 the transformation A is in H®). 


22.9 Littlewood and Oppenheim conjectures in the framework of 
multidimensional continued fractions 


In this section we briefly show a link between the Littlewood and Oppenheim con- 
jectures and a certain conjecture in the theory of multidimensional continued frac- 
tions. Letting v be an arbitrary real number, denote by || v || the distance from this 
number to the set of integers, i.e., 


\|v|| = min |v —&]. 
keZ 


Let us formulate the original conjectures. 


Conjecture 24. (Littlewood conjecture.) For an arbitrary pair of real numbers 
(a, B) the following holds: 


inf_m||mar|||mp || =0. 
meZy. 


Conjecture 25. (Oppenheim conjecture.) Let n > 3. Consider n linearly indepen- 
dent linear forms L),...,Z, in IR”. Suppose that 


22.10 Exercises 315 


inf |Li(p)-Lo(p)+:-Ln 0. 
seo 1(p)-Lo(p)++-Ln(p)| > 


Then there exists an irreducible RS-matrix A € SL(n, Z) such that the planes L;(p) = 
0 for i=1,...,n are the invariant planes of A. 


Remark. It is known that for n = 3 the Oppenheim conjecture implies the Littlewood 
conjecture (see, for instance, [39]). 


Now we reformulate the Oppenheim conjecture in terms of the geometry of sails. 
We start with the following definition. 


Definition 22.35. Let F be an arbitrary (n — 1)-dimensional face of a sail in R” and 
let v],...,Vm be the vertices of F. Then the determinant of F is 


det F = y |det(vi,,...,vi,)]- 


<i, <<in<n 


We also say that the determinant of an edge star St, is the Euclidean volume of 
the convex hull conv(St,). 


Theorem 22.36. (O. German [77].) Let n > 3. Consider n linearly independent 
linear forms L,,...,Ly in R". Denote by C the cone 


C={peER"|L,(p) =0, i=1,...,n}. 


Then the following two statements are equivalent: 
i) inf |L -L DT, > 0; 
(int MEa(p) -La(p)---La() 
(ii) the faces of the sail Sc have uniformly bounded determinants. 


This theorem reduces the Oppenheim conjecture to the following conjecture. 


Conjecture 26. Let the faces and edge stars of a cone C have uniformly bounded 
determinants. Then the sail Sc is algebraically periodic (i.e., it is a sail of some 
irreducible RS-matrix of SL(n,Z)). 


Theorem 22.32 implies that to check periodicity it is enough to find an H,- 
periodic chain of vertices of the sail. 

Finally, we would like to mention here one unsuccessful attempt (which is never- 
theless worthy mentioning) to solve these conjectures in [206] and [207]. For further 
details we refer the interested reader to [75], [76], and [77]. 


22.10 Exercises 


Exercise 22.1. Suppose that a polynomial p with integer coefficients is irreducible 
over Q. Prove that all the roots of p are distinct. 


Exercise 22.2. Check that the Dirichlet groups for Examples 22.9, 22.11, and 22.12 
are correctly calculated. 
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Exercise 22.3. Let A € SL(3,Z) be an irreducible RS-matrix. Prove that ||A|| > 5. 


Exercise 22.4. Prove that the torus decomposition consisting of a single square of 
integer area 2 is not realizable as a sail of a periodic two-dimensional continued 
fraction. 


Exercise 22.5. Consider arbitrary integers m and n. Suppose that the matrix Amy 
is an irreducible RS-matrix. Prove that the matrix A_y—m is also an irreducible RS- 
matrix and that the continued fraction associated to A_y—m is integer congruent to 
the continued fraction associated to Aj, n. 


Exercise 22.6. Consider a cone C and its sail Sc in R”. Let (v;) be some chain of 
the sail Sc. Prove that each n — | consecutive edges of the chain are edges of a face 
in Sc. 
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Fig. 22.1 The sail of an algebraic continued fraction (top); its periodic structure (middle); a fun- 
damental domain (bottom). 
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Chapter 23 
Multidimensional Gauss—Kuzmin Statistics 


In this chapter we study the distribution of faces in multidimensional continued frac- 
tions. The one-dimensional Gauss—Kuzmin distribution was described in Chap- 
ter 13, where we discussed the classical approach via ergodic theory and a new 
geometric approach. Currently, an ergodic approach to the distribution of faces in 
continued fractions has not been developed. In fact, it is a hard open problem to 
find an appropriate generalization of the Gauss map suitable to the study of ergodic 
properties of faces of multidimensional sails. This problem can be avoided, and in 
fact, the information on the distribution of faces is found via the generalization of 
the geometric approach via Mobius measures described in the second part of Chap- 
ter 13 for the one-dimensional case. We discuss the multidimensional analogue of 
the geometric approach in this chapter. 


23.1 M6bius measure on the manifold of continued fractions 


We begin with a description of the structure of a smooth manifold on the set of 
multidimensional continued fractions. Further, we extend the notions of Modbius 
forms and measures to the multidimensional case. 


23.1.1 Smooth manifold of n-dimensional continued fractions 


Denote the set of all continued fractions of dimension n by CF, An arbitrary con- 
tinued fraction is defined by an unordered collection of hyperplanes (71,..., +1). 
Denote by /;, for i= 1,...,2+1, the intersection of all the above hyperplanes except 
for the hyperplane 7;. Obviously, /,...,/,41 are independent straight lines (i.e., 
they are not contained in a hyperplane) passing through the origin. These straight 
lines form an unordered collection of independent straight lines. Conversely, every 
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unordered collection of n+1 independent straight lines uniquely determines some 
continued fraction. 

Denote the sets of all ordered collections of n+1 independent and dependent 
straight lines by FCF, and A,,, respectively. We say that FCF, is the space of n- 
dimensional framed continued fractions. Also denote by S,,+1 the permutation group 
acting on ordered collections of n+1 straight lines. In this notation we have 


FCF, = (RP" x RP" x --» x RP”) \ A, and CF, = FCF, /Sn+1- 
a ae” 
n+l times 


This implies that the sets FCF, and CF,, admit natural structures of smooth mani- 
folds induced by the Cartesian product of n+1 copies of RP”. Note also that FCF, is 
an ((n+1)!)-fold covering of CF,,. We call the map p : FCF, — CF, of “forgetting” 
the order in the ordered collections the natural projection of the manifold FCF, to 
the manifold CF,,. 


23.1.2 Mobius measure on the manifolds of continued fractions 


The group PGL(n+1,]R) takes the set of all straight lines passing through the ori- 
gin of (n+1)-dimensional space into itself (for the definition of PGL(n+1,R) and 
cross-ratios we refer to Chapter 13). Hence, PGL(n+1,R) naturally acts on the man- 
ifolds CF, and FCF,,. Furthermore, the action of PGL(n+1,R) is transitive, i. e., it 
takes any (framed) continued fraction to any other. Note that for any n-dimensional 
(framed) continued fraction, the subgroup of PGL(n+1,R) taking this continued 
fraction to itself is of dimension n. 


Definition 23.1. A form of the manifold CF, (respectively FCF,,) is said to be a 
Mobius form if it is invariant under the action of PGL(n+1,R). 


Transitivity of the action of PGL(n+1,R) implies that every two n-dimensional 
Mobius forms of the manifolds CF, and FCF, are proportional. 


Definition 23.2. Let M be a smooth enough arcwise connected manifold and let @ 
be a volume form on it. Denote by U@ the measure on M defined as follows: for 
every open measurable set § Cc M, 


Definition 23.3. A measure L of the manifold CF, (FCF,,) is said to be a Mébius 
measure if there exists a Mobius form @ of CF, (FCF,,) such that LW = Uo. 


Every two Mobius measures of CF, (FCF,,) are proportional. The projection p 
projects the Mobius measures of the manifold FCF, to the Mobius measures of the 
manifold CF,,. 
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23.2 Explicit formulae for the Mobius form 


Let us write down explicitly the Mébius forms for the manifold of framed n- 
dimensional continued fractions FCF, for arbitrary n. 

Consider R’*! with standard metrics on it. Let 2 be an arbitrary hyperplane of the 
space R"+! with chosen Euclidean coordinates OX) ...X,, and assume that 2 does 
not pass through the origin. We call the set of all collections of +1 ordered straight 
lines intersecting the plane 7 the chart FCF;,,z of the manifold FCF,,. Let the inter- 
section of 7 with the ith plane be a point with coordinates (x1, .++;Xn,) on 1. For 
an arbitrary tetrahedron A;...A,+1 in the plane 2, we denote by Vz(A1,...,An+1) 
its oriented Euclidean volume (with respect to the orientation induced by the coor- 
dinates OX, 1 ...Xn1X1,2-.-Xnv41 of the chart FCF, 7). 


Remark 23.4. The chart FCF; is everywhere dense in (Re)eF!, 
Consider the following form in the chart FCF,,z: 


n+1 n 
A | A 4x) 
i=1 \ j=l 
On (X115-+-Xnnt1) = Va(At,-.Anni Vth 


Proposition 23.5. The measure [q, is a restriction of a Mobius measure to FCF, x. 


Proof. Every transformation of the group PGL(n+1,R) is in one-to-one correspon- 
dence with the set of all projective transformations of the plane 7. Let us show that 
the form @, is invariant under the action of the transformations (of the everywhere 
dense set) of the chart FCF, that are induced by projective transformations of the 
hyperplane 7. 

Let us denote by |¥|z the Euclidean length of a vector ¥ in the coordinates 
OX, 1 ..-Xnn+1 of the chart FCF,,7. Consider an arbitrary point (1,1,...,%n+41) 
of the chart FCF,,,7. Denote by A; = Aj(x1;,-..,%nj) the point depending on the co- 
ordinates of the plane 2 with coordinates (x1j,...,%n,), i= 1,...,n+1. So for the 
chosen point, we have a naturally defined tetrahedron A, ...A,+1 in the hyperplane 
m. Set 


AjAj 
|AjAiln’ 


fii= ij=l,...ntl, iF j. 
It is clear that the vectors f;; and f ;; are parallel to the vector A ;A;. The collection 
(fii) is a basis in the tangent space to FCF;,,z, which depends continuously on the 
point in the chart. By dv;; we denote the 1-form corresponding to the coordinate 
along the vector f;; of FCFy,z. 

Let us rewrite the form @,z in new coordinates: 
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On (X11, ee ied) 


BEE VA Aisne Mpcrt Agcy Aye ) _dvn1 Adv31 A+++ Adan 


-U Al ViAigAe et 
i=1 Tl |A;Ailx x ( 1; ’ n+l) 
k=V ki 
Seal: dv21 Ndvi2_, dv32\dv73,. , @Vntt.nNAVnnt1 
|A1Aa|z |A2A3|z An tAn|z 


As in the one-dimensional case, the expression 


AvjjA Vii 
|AiA j| 


for the infinitesimally small Av;; and Av; is the infinitesimal cross-ratio of the four 
points Aj, Aj, Aj+Avjif ;;, and Aj+Avjjf;; on the straight line A;A;. Therefore, the 
form @, is invariant under the action of the transformations (of the everywhere 
dense set) of the chart FCF, that are induced by the projective transformations 
of the hyperplane 7. Hence the measure HU, coincides with the restriction of some 
Mobius measure to FCF), 7. 


Corollary 23.6. A restriction of an arbitrary Mobius measure to the chart FCF,,7 
is proportional to Le,. 


Proof. The statement follows from the proportionality of any two Mobius measures. 


Let us fix an origin Oj; for the straight line A;A;. The integral of the form dv;; 
(respectively dv ;;) for the segment O;;P defines the coordinate v;; (v;;) of the point 
P contained in the straight line A;A;. As in the one-dimensional case, consider a 
projectivization of the straight line A;A ;. Denote the angular coordinates by @;; and 
(ji, respectively. In these coordinates, 


|A‘Aj|Z 9 4 


dvighdvi 1 (25 ij 
2 


) AQ; \AQji- 


Then the following is true. 


Corollary 23.7. The form @, is extendible to some form @, of F CF,,. In coordinates 
Vij, the form @y is as follows: 


—1)L52!) (etl att see ntl f ntl 
a= Fie TT TT oo? (25%) AA A agijAdgi) }- 


i=l j=i+1 i=1 \j=i+1 
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23.3 Relative frequencies of faces of multidimensional continued 
fractions 


Without loss of generality we choose precisely the form @, of Corollary 23.7. De- 
note by Un the projection of the measure U@, to the manifold of multidimensional 
continued fractions CF. 

Consider an arbitrary polytope F with vertices at integer points. Denote by 
CF,,(F) the set of n-dimensional continued fractions that contain the polytope F 
as a face. 


Definition 23.8. The value (CF,(F)) is called the relative frequency of a face F. 
Let us specify the integer-linear congruence of faces. 


Definition 23.9. We say that two faces of a sail are integer-linear congruent if there 
exists an SL(n, Z)-transformation sending one face to the other. 


Remark 23.10. It some sense the structure of a face includes the origin and the 
polyhedron of this face. Of course, if two faces are integer-linear congruent, the 
corresponding polyhedra are integer congruent. The converse is not true. For in- 
stance, two empty triangles (that are integer congruent) can be integer distance one, 
and integer distance two to the origin. Such faces are not integer-linear congruent. 


Proposition 23.11. The frequencies of integer-linear congruent faces coincide. 
The following problem is open for n > 2. 


Problem 27. Find faces of n-dimensional continued fractions with the highest rel- 
ative frequencies. Is it true that for every constant C there exist only finitely many 
pairwise integer noncongruent faces whose relative frequencies do not exceed C? 
Find the corresponding asymptotics (with respect to C tending to infinity). 


One of the first sets of problems on statistics of faces of multidimensional con- 
tinued fractions was proposed by V.I. Arnold. Let us consider the set of three- 
dimensional integer matrices with rational eigenvalues, denoted by A3. The con- 
tinued fraction associated to a matrix of A3 consists of finitely many faces. Denote 
by A3(m) the set of all the matrices of A3 for which the sums of absolute values of 
all their elements are not greater than m. The number of such matrices is finite. Let 
us calculate the number of triangles, quadrangles, etc., among the continued frac- 
tions, constructed for the matrices of A3(m). As m tends to infinity we have a general 
distribution of the frequencies for triangles, quadrangles, etc. We call the resulting 
frequencies of integer faces the frequencies of faces in the sense of Arnold. The 
problems of V.I. Arnold include the study of certain properties of such distributions 
(for instance, what is more frequent, triangles or quadrangles, what is the average 
number of integer points inside the faces, etc.). The majority of these problems are 
open. For further information we refer to [11] and [12]. 


Remark 23.12. Let us say a few words about the following particular question of 
V.I. Arnold: which faces in two-dimensional sails are more frequent, triangular or 
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quadrangular? For instance, in the case of R?, there are two natural models of ran- 
dom polyhedra. The first one is to choose k random points in space and to construct 
their convex hull. For the second method, one should consider the intersection of k 
random half-spaces. In the first method, a random convex hull has only triangular 
faces, while in the second the average number of vertices of faces is four (Schlafli— 
Saharov theorem). So it is interesting to understand whether two-dimensional sails 
are “closer” to random polyhedra in the first sense (in this case they have more 
triangles) or in the second sense (then they should have more quadrangles). 


The following conjecture connects the notions of relative frequencies and fre- 
quencies in the sense of Arnold. 


Conjecture 28. The relative frequencies of faces are proportional to the frequencies 
of faces in the sense of Arnold. 


This conjecture is open in the n-dimensional cases for n > 2. 


23.4 Some calculations of frequencies for faces in the 
two-dimensional case 


23.4.1 Some hints for computation of approximate values of 
relative frequencies 


Consider the space R* with the standard metric on it. Let 7 be an arbitrary plane in 
IR? not passing through the origin and with fixed system of Euclidean coordinates 
OnXnYqz. Let FCF2,_ be the corresponding chart of the manifold FCF). For an ar- 
bitrary triangle ABC of the plane 2, we denote by Sz(ABC) its oriented Euclidean 
area in the coordinates O7X|Y|X2¥2X3Y3 of the chart FCF,7. Denote by |?|z the Eu- 
clidean length of a vector V in the coordinates O7.XY|X2¥2X3Y3 of the chart FCF) 7. 
Consider the following form in the chart FCF),z: 


dx; Ady, \dx2 \dy2 \dx3 \dy3 
Sa ((x1,91) (x2, ¥2)(x3,y3))3 


x (X1,91,X2,¥2,X3,3) = 


Note that the oriented area Sz of the triangle (x1,y1)(x2,y2)(x3,y3) can be ex- 
pressed in the coordinates ~;, y; as follows: 


Sx ((x1,91)(%2,92)(*3,.93)) = Sle — x2y3 +-x193 —x3y1 $x291 —X1y2)- 

For the approximate computations of relative frequencies of faces it is useful to 
rewrite the form @, in the dual coordinates (see Remark 23.14 below). Define a 
triangle ABC in the plane z by three straight lines /), /2, and /3, where J; passes 
through B and C, /z passes through A and C, and /3 passes through A, and B. Define 
the straight line /; (@ = 1,2,3) in z by the equation (after first making a translation 


23.4 Some calculations of frequencies for faces in the two-dimensional case 325 
of 7 in such a way that the origin is taken to some inner point of the triangle) 
aix+biy=1 
in the variables x and y. Then if we know the 6-tuple of numbers 
(a1,b1,42,b2,a3,b3), 


we can restore the triangle in a unique way. 


Proposition 23.13. In coordinates a,b, ,a2,b2,a3,b3 the form @, can be written 


as follows: 
8da, \ db, \ daz \ dbz \ da3 \ db3 


(a3b2 — anb3 +a1b3—a3b, + arb — ayb)3 - 


So, we reduce the computation of relative frequency for the face F (1. e., the value 
of o(CF;(F))) to the computation of the measure U,(p-'(CF)(F))). Consider 
some plane 7 in R? not passing through the origin. By Corollary 23.7 we have 


Mo, (p-'(CFi(F))) = Har (P'(CF2(F))) 0 (FCF2.x)- 


Finally, the computation should be made for the set Uw, (p~'(CF2(F))) M (FCFi,x) 
in dual coordinates a;,b; (see Proposition 23.13). 


Remark 23.14. In coordinates a;,b; the computation of the value of the relative fre- 
quency often reduces to the estimation of the integral on the disjoint union of a finite 
number of six-dimensional Cartesian products of three triangles in coordinates aj, b; 
(see Proposition 23.13). The integration over such a simple domain significantly in- 
creases the speed of approximate computations. In particular, the integration can be 
reduced to the integration over some 4-dimensional domain. 


23.4.2 Numeric calculations of relative frequencies 


23.4.2.1_ Remark on complexity of numeric calculations 


Explicit calculation of relative frequencies for the faces seems not to be realizable. 
Nevertheless, it is possible to make approximations of the corresponding integrals. 
Usually, the greater the area of the polygon, the smaller its relative frequency. The 
most complicated approximation calculations correspond to the simplest faces, such 
as an empty triangle. 

In Fig. 23.1 we examine examples of the following faces: 

— triangular faces (0,0,1), (0,1, 1), (1,0,1), and (0,0, 1); 

— triangular face (0,2,1), (2,0, 1); 

— quadrangular face (0,0,1), (0,1,1), (1,1,1), (1,0, 1). 
For each face we show the plane containing the face. 
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Fig. 23.1 The points painted in light gray correspond to the points at which the rays defining the 
two-dimensional continued fraction can intersect the plane of the chosen face. 


A point is shaded gray if there exists a continued fraction such that one of the 
rays defining it intersects this plane. The light gray points correspond to the points 
at which the rays defining the two-dimensional continued fraction intersect the plane 
of the chosen face. 


23.4.2.2 Some numeric results 


In Table 23.1 we show the numerical approximations of relative frequencies for 
12 faces. In the column “N°” we write a special sign for integer-linear congruence 
classes of a face. The index denotes the integer distance from the corresponding 
face to the origin. In the column “face” we draw a picture of the integer-linear con- 
gruence class of the face. Further, in the columns “1S” and “ld” we write down the 
integer areas of faces and integer distances from the planes of faces to the origin 
respectively. Finally, in the column “U2” we show approximate relative frequencies 
for the corresponding face. 

Notice that in the given examples the integer congruence classes of polygons and 
integer distances to the origin determine the integer-linear congruence type of the 
face. This is not always the case where we consider greater distances to the origin. 


Remark 23.15. The majority of faces of two-dimensional continued fractions are 
at unit integer distance from the origin. Only three infinite series and three partial 
examples of faces lie at integer distances greater than one from the origin, as seen 
in Chapter 19. If the distance to the face is increasing, then the frequency of faces is 
decreasing on average. The average rate of decrease of the frequency is unknown to 
the author. 
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N° |face IS |Id Lo N° |face IS |Ild Lo 

rf Sf 3|1{1.3990-10-2]] VI; |? y «74 1/3.1558- 10-4 
L <S 3 |3]1.0923-1073|] Vb | ¢ , «| 7/2 /3.1558- 10-4 
I, <f 5|1/1.5001-10-3]] VI, | ¢ ~ [11] 113.4440- 10-5 
108 if 7 | 1]3.0782- 10-4] VIN} * 7 | 1]5.6828- 10-4 
IV, xf 9 | 1]9.4173- 1075] 1X, * |7]1/1.1865- 10-3 
Vv; 11) 1]3.6391-1075]} Xy t} 6 | 1/9.9275- 1074 


Table 23.1 Some numeric results of calculations of relative frequencies. 


23.4.3 Two particular results on relative frequencies 


In conclusion, we give two simple statements on relative frequencies of faces. 


Proposition 23.16. Integer congruent polygons P, and P at integer distances | and 
2 to the origin have the same relative frequencies (see, for example, V1, and V1> of 
Table 23.1). 


Denote by A,, the triangle with vertices (0,0,1), (n,0,1), and (0,7, 1). Denote by 
B,, the square with vertices (0,0, 1), (n,0,1), (m,n, 1), and (0,7, 1). 


Proposition 23.17. The following holds: 


tim #(CFalAn)) 


me (CFB) 


For further information related to multidimensional Gauss—Kuzmin statistics 
we refer to the work [124] by M.L. Kontsevich and Yu.M. Suhov and [101] by the 
author. 
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23.5 Exercises 


Exercise 23.1. Show that relative frequencies of integer-linear congruent faces are 
equivalent. 


Exercise 23.2. Prove Propositions 23.16 and 23.17. 


Exercise 23.3. For every face F of Table 23.1 draw the intersection set of the plane 
spanned by F with all the rays that define two-dimensional continued fractions hav- 
ing F as a face (see examples in Fig. 23.1). 


Exercise 23.4. Calculate up to the second digit the relative frequencies for the faces 
of types VII; and Xj in Table 23.1. 


| ® 
Check for 
updates 


Chapter 24 


On the Construction of Multidimensional 
Continued Fractions 


In the first part of this book we saw that the LLS sequences completely determine 
all possible sails of integer angles in the one-dimensional case. The situation in the 
multidimensional case is much more complicated. Of course, the convex hull algo- 
rithms can compute all the vertices and faces of sails for finite continued fractions, 
but it is not clear how to construct (or to describe) vertices of infinite sails of multi- 
dimensional continued fractions in general. What integer-combinatorial structures 
could the infinite sails have? There is no single example in the case of aperiodic in- 
finite continued fractions of dimension greater than one. The situation is better with 
periodic algebraic sails, where each sail is characterized by its fundamental domain 
and the group of period shifts (i.e., the positive Dirichlet group). 

In this chapter we show the main algorithms that are used to construct examples 
of multidimensional continued fractions (finite, periodic, or finite parts of arbitrary 
sails). We begin with some definitions and background. Further, we discuss one 
inductive and two deductive algorithms to construct continued fractions. Finally, we 
demonstrate one of the deductive algorithms on a particular example. 


24.1 Inductive algorithm 


24.1.1 Some background 


A multidimensional periodic algebraic continued fraction is a set of infinite polyhe- 
dral surfaces (i.e., sails) that contain an infinite number of faces. As we have already 
mentioned, the quotient of every sail under the positive Dirichlet group action is iso- 
morphic to the n-dimensional torus. The algebraic periodicity of the polyhedron al- 
lows us to reconstruct the whole continued fraction knowing only the fundamental 
domain (i.e., the union of faces that contains exactly one face from each equiva- 
lence class with respect to the action of the corresponding group =). Moreover, 
every fundamental domain contains only a finite number of faces of the whole alge- 
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braic periodic continued fraction. Hence we are faced with the problem of finding a 
good algorithm that enumerates all the faces for this domain. 

There was no algorithm for constructing multidimensional continued fractions 
until T. Shintani’s work [200] in 1976. The algorithm was further developed by 
R. Okazaki [166] in 1993. The authors showed the so-called inductive method for 
constructing fundamental domains of multidimensional continued fractions. The al- 
gorithm produces the fundamental domain face by face, verifying that each new face 
does not lie in the same orbit with some face constructed previously. Applying the 
algorithm, one finds the fundamental domain in finitely many steps. In some sense 
this is similar to the classical construction of one-dimensional continued fractions. 

Using inductive algorithms, E. Thomas and A.T. Vasques obtained several fun- 
damental domains for the two-dimensional case in [211]. Later, E. Korkina in her 
papers [128], [129], [130] and G. Lachaud in [134] produced an infinite number of 
fundamental domains for periodic algebraic two-dimensional continued fractions. 


24.1.2 Description of the algorithm 


For a polytope F and its hyperface E we say that the collection (E,F) is a hyper- 
flag. Every (n—1)-dimensional face of a sail for an (n—1)-dimensional continued 
fraction is called a hyperface of the sail. 


Inductive algorithm for constructing the fundamental domains for sails of 
multidimensional continued fractions. 


Input data. We have a matrix A in SL(n,Z) and one of its invariant cones. We start 
with an empty list of hyperflags ZL and empty fundamental domain D. 


Goal of the algorithm. To construct one of the fundamental domains of the contin- 
ued fraction for A in the given invariant cone. 


Step la. Construct a hyperface Fo of the sail of the cone. 


Step 1b. Calculate the face-structure of this face. Add all the hyperflags containing 
Fo to the list Z as unmarked. 
Step 2. Take an unmarked hyperflag (F,£) of the list L and 

— construct the hyperface F’ adjacent to the hyperface F along E’; 

— check whether F’ is not an image of another face F of the list D under the 
action of some element of the positive Dirichlet group 2, (A). If F’ is not an image, 
then add F’ to the list D; 

— mark (F,E) as marked; add (F’, E) as marked. 

Step 3. Until the list Z contains at least one unmarked hyperflag, go to Step 2. 


Step 4. Find the face decomposition of all hyperfaces in the list D. It is the closure 
of one of the fundamental domains. 
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Output. The face decomposition of the hyperfaces in the list D is the closure of one 
of the fundamental domains of an (n—1)-dimensional continued fraction. 


Remark 24.1. The algorithm is relatively slow in Step 2, but the advantage is that 
one does not need to construct the basis of the positive Dirichlet group. If we work 
with nonalgebraic sails, then this algorithm constructs a finite domain of a sail (here 
in Step 2, we only construct an adjacent face). 


24.1.3 Step la: construction of the first hyperface 


24.1.3.1 How to calculate one vertex of the sail 


Consider an arbitrary invariant cone C. Let us find some integer point in C. Shift the 
basis unit parallelepiped completely into the cone C. There exists an integer point 
lying inside the shifted parallelepiped. The coordinates of this point coincide with 
integer parts of coordinates for one of the 2” vertices of this parallelepiped. 

Having found some integer point P of the invariant cone C, let us construct some 
vertex of the sail of C. Consider some integer plane 7 passing through the origin 
such that 

TAC=O, 


where O is the origin. Suppose that the integer distance from the point P to this 
plane is equal to d. Now we look through all the simplices obtained as intersections 
of the cone C with planes parallel to 7 at integer distances to the origin equal to 
1,...,d. Suppose that the first simplex containing integer points lies in the plane at 
integer distance d’ < d. The convex hull of all points of this simplex coincides with 
some faces of the sail. All vertices of this face are vertices of the sail. Choose one 
of them. 


24.1.3.2 Inductive construction of a hyperface 


Suppose that we have already constructed a face Fy of the sail of dimension k—1 
(where k < n). Let us construct a face of the sail of dimension k. Let 2; be an integer 
plane of dimension n—2 that contains Fy, intersects the cone C in a compact set, and 
does not pass through the origin O. Consider a two-dimensional plane 72 orthogonal 
to 7. Let us project the cone and all its inner integer points along 71 to 72, denoting 
this projection by f. The image of the cone f(C) is a two-dimensional cone. Letting 
S be the set of all integer points inside the cone, then f(S) is contained in the cone 
f(C). In addition, f(S) is contained in f(Z”"), which is a two-dimensional discrete 
lattice (since 7 is integer). 

Consider a line / in m2 passing through f(F;) and assume that this line cuts from 
the cone f(C) a triangle that contains points of f(S). Take the convex hull of these 
points in the triangle. One of the two neighboring points of f(S) to f(F;,) in the 
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boundary of this convex hull is in the boundary of the convex hull for f(S) (we call 
this point P). Let s € S and f(s) =P. Then the span of s and the face F; is in some 
face of the sail of dimension k. The face F,.; consists of all integer points in the 
tetrahedral intersection of the plane spanned by F; and s with the cone C. 

So we get the face F,4. Proceed further until we get a hyperface F,,. 


24.1.4 Step 1b, 4: how decompose the polytope into its faces 


Let 7 be an oriented plane of dimension r, and let po, p1,...,p, be points in this 
plane. Define 


1 if popi,.--, Pop, defines a positive orientation, 
sign; (Po, P1;---;Pr) = 4 9 if popi,---,Popr are dependent, 
—1 if popi,-.--, opr defines a negative orientation. 


Proposition 24.2. Consider an r-dimensional convex polytope p\...ps, and sup- 
pose it spans the plane 1 of dimension r. Consider a subset of indices i, < ... < 
i, < s. Let the tetrahedron T = pj, --- pi, have nonzero Euclidean (or integer) (r—1)- 
dimensional volume. Then we have: 

(i) The tetrahedron T is in a hyperface if and only if the function 


sign, (pj, Pi,,--->Di,) 


is either nonnegative or nonpositive simultaneously for all j € {1,...,s}. 


(ii) If T is in some hyperface, then this hyperface consists of points pj; satisfying the 
equation 

sign, (Pj, Pi, gars e ,Di,) = 0. 
Proof. The condition of the first item means exactly that all points are in one half- 


plane with respect to the hyperplane containing the tetrahedron pj, ... pi,. 
The condition of the second item enumerates the points contained in the plane of 


T. 


Remark 24.3. Using Proposition 24.2 one constructs all hyperfaces of a polyhe- 
dron P. Further decomposition into faces is done iteratively, applying the hyperface 
search to already constructed faces. 


24.1.5 Step 2: construction of the adjacent hyperface 


Suppose that we know a hyperface F and one of its hyperfaces EF. Let us construct 
the hyperface Fr adjacent to F via E. 
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Let 7 be an integer plane of dimension n—2 containing EF and a two-dimensional 
plane 7 orthogonal to 7. Let us project the cone and all its inner integer points 
along 7) to 7%, again denoting this projection by f. The image of the cone f(C) is 
a two-dimensional cone. Letting S be the set of all integer points inside the cone, 
then f(S) is contained in the cone f(C), and in addition, f(S) is contained in f(Z”), 
which is a two-dimensional discrete lattice (since 7 is integer). The image of F is 
the segment AB, and the image of the vertex O of the cone is the point denoted by 
O'. Consider a point in f(S) \ O'AB, denoting it by P. 

Without loss of generality, we suppose that the points A and P are in one half- 
plane with respect to the line O’B. The line AP cuts from the cone f(P) a triangle 
containing points of f(S). Take the convex hull of these points in the triangle. One 
of the two neighboring points of f(S) to B in the boundary of this convex hull is 
in the boundary of the convex hull for f(S) (we call this point Q). Let s € S and 
f(s) = Q. Then the span of s and the face E is in a hyperface of the sail adjacent to 
E. The hyperface Fr consists of all integer points in the tetrahedral intersection of 
the plane spanned by E and s with the cone C. 


24.1.6 Step 2: test of the equivalence class for the hyperface F' to 
have representatives in the set of hyperfaces D 


Consider two ordered bases (e!) and (e?) with matrices M and Mp in the standard 
coordinate basis. It is clear that (e}) is integer congruent to (e?) if and only if 


M2(M,)~' € SL(n,Z). 


Here one needs to check that the elements of M>(M,)~! are all integer and that the 
determinant equals 1. 

If in addition we are interested whether these two bases are in the same class 
with respect to some Dirichlet group 2(A), then we should check that the matrix 
M>(M,)~! commutes with the matrix A. 


Suppose that we are willing to test whether the equivalence class of a hyperface 
F' has representatives in the set of faces D. Take an arbitrary n-tuple of independent 
integer points in F’ and the basis (e;) of R” corresponding to it. The test is positive if 
there exist a face in D and an n-tuple of independent integer points in this face such 
that the corresponding basis (g;) is integer congruent to (e;) and the transformation 
matrix commutes with the matrix A defining the cone. 
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24.2 Deductive algorithms to construct sails 


24.2.1 General idea of deductive algorithms 


In inductive algorithms one constructs faces of the sail one by one inductively. There 
is another approach to constructing faces of the sail for the cone C. First, one should 
construct an approximation of the sail itself, and, second, choose the faces of the 
approximation that are the faces of the sail. This method is especially useful in 
the algebraic periodic case. Suppose that we are given an integer irreducible real 
spectrum matrix A € SL(n+ 1,Z). To compute some fundamental domain of a sail 
of the continued fraction associated to A it is sufficient to do the following: 

1. Compute a convex hull approximation of the sail. Namely, take a large enough 
convenient set of integer points and find its convex hull. 

2. Make a conjecture on some fundamental domain. Here we need to guess a set 
of faces that might form a fundamental domain. We do this by finding a repeatable 
pattern in the geometry of the faces. 

3. Prove the conjecture if possible. 

4. If you cannot prove the conjecture, start with 1, but with a larger convenient 
set of points. 


In this situation the following two questions are relevant: 
How can one find a convenient set of integer points for the approximation of the 
sail? 
How can one test whether the conjecture of a fundamental domain of the sail is 
true? 
Let us give answers to these questions. 


24.2.2 The first deductive algorithm 


In [156] J.O. Moussafir developed an essentially different approach to constructing 
continued fractions. It works for an arbitrary (not necessarily periodic) continued 
fraction and computes any bounded part of an infinite polyhedron. The approach is 
based on deduction. One produces a conjecture on the face structure for a large part 
of the continued fraction. Then it remains to prove that every conjectured face is 
indeed a face of the part. This method can be also applied to the case of periodic 
continued fractions. 


The scheme of the first deductive algorithm. 


Input data. A cone C. 


Goal of the algorithm. Construct a large enough set of faces in C. 
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Step 1. Find an approximation of the cone and make a face decomposition (better if 
inside the approximated cone). 


Step 2. Determine which faces are the real faces of the sail. Go to Step | until there 
are enough faces. 


Step 3. In the algebraic case, find the fundamental domain of the sail. 
Output. A large enough set of faces in C. 


In the first step we approximate the cone in the following way. The cone is de- 
fined by its edges, so we take a good approximation of all the edges, and get a cone 
approximating our cone. 


In the second step we should check that certain polyhedra are faces of the sail 
for the cone. So we are solving the following problem: starting with a cone C with 
vertex at the origin and an integer polyhedron F, find out whether F is a face of the 
sail for C. Denote by 7p an integer hyperplane containing F’. 

(i). Check that the intersection of the cone C with ap is a simplex, find all the 
integer points inside it, and check that their convex hull coincides with F. 

(ii). Find the integer distance d from the origin to F. Check that all integer planes 
parallel to zr and having the origin and F in distinct half-planes (there are exactly 
d—1 such planes) do not have integer points in the intersection with the cone C. 


In the third step we find out which faces of the constructed part are shifted by 
matrices of the group £4 (A) to each other. This was discussed in Section 24.1.5. 


As an example, we discuss one particular case of two-dimensional sails later, in 
Section 24.2.4. 


24.2.3 The second deductive algorithm 


In this section we describe another deductive construction adapted especially to 
fundamental domains of periodic continued fractions. The construction involves a 
method for conjecturing the structure of the fundamental domain and an algorithm 
testing whether the conjectured domain is indeed fundamental. Usually the number 
of “false” vertices of this approximation is much smaller than the number of “false” 
vertices of the approximation produced by the first deductive algorithm. 

Note that this algorithm substantially uses the periodicity of a multidimensional 
continued fraction, and hence it is impossible to apply it to aperiodic continued 
fractions. 

Almost all known simple examples and series of examples of fundamental do- 
mains were constructed using this algorithm (see [96]). In particular, the complete 
list of all two-dimensional periodic continued fractions constructed by matrices of 
small norm (| *« | <6) was found in [97] (see also Theorem 22.13 above). 


Let us briefly itemize the main steps of the algorithm. 
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The second deductive algorithm to construct fundamental domains. 


Input data. A matrix A € SL(n, Z) and one of its invariant cones. 


Goal of the algorithm. Construct a fundamental domain for the sail of the given 
matrix A for the given invariant cone. 


Step 1. Calculate the basis of the additive group of the ring I"(A). 
Step 2. Calculate the basis of the group =+(A) (using the result of Step 1). 
Step 3. Find some vertex of the sail (see Section 24.1.3). 


Step 4. Make a conjecture on a fundamental domain of the sail (using the results of 
Step 2 and Step 3). 


Step 5. Test the resulting conjecture from Step 4. 


Output. A fundamental domain of an operator A in the given invariant cone. 


Remark 24.4. It is assumed that the fundamental domain conjectured in Step 4 and 
the basis A;,...,A, of the group 44 (A) satisfy the following conditions: 

(i) the closure of the fundamental domain is homeomorphic to the disk; 

(ii) the operators with matrices A,...,A, define the gluing of this disk to the n- 
dimensional torus. 


We show how to test conjectures in the case of two-dimensional continued frac- 
tions in the description of Step 5. The result is partially based on Corollary 20.36 
on the classification of two-dimensional faces at the integer distances to the origin 
greater than one (see also in [98]). For the case of n-dimensional continued frac- 
tions for n > 3, the last step is quite complicated, since the classification of three- 
dimensional faces at integer distances to the origin greater than one is unknown. 


Remark 24.5. Note that all deductive algorithms are not algorithms in the strict 
sense. One needs to choose some basis of 2+ (A) in the right way, produce a good 
conjecture, and then test it. Even the algorithmic recognition of the period for the 
given picture of a sail approximation is thought to be a hard problem. That is why 
this “algorithm” cannot be completed with certainty by some computer program. 
But on the other hand, this deductive algorithm is effective in practice. All of the 
examples listed in the article [96] were produced using this algorithm. The examples 
of this paper generalize and expand almost all known periods of the sails calculated 
before. 


Remark 24.6. Deductive methods can be naturally generalized to the calculations 
of periods of certain other periodic multidimensional continued fractions related 
to cones (for more information we refer to the works by H. Minkowski [151] and 
G.F. Voronoi [220].) 


General questions concerning the lattice bases (Steps 1 and 2) were discussed in 
Chapter 21 for the calculation of a vertex of the sail (Step 3); see Section 24.1.3. 
Step 5 is general for both deductive algorithms. It will be discussed in the next 
subsection. So it remains to describe Step 4. 
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24.2.3.1 Step 4. How to produce a conjecture on the fundamental domain of a 
sail 


Suppose that we know some vertex V of the sail in the invariant cone (from Step 3) 
and a basis Aj,...,A, for the group 24 (A) (from Step 2). Let us briefly discuss how 
to produce a conjecture on a fundamental domain of the sail. First, we compute the 
set of integer points that contains all vertices of some fundamental domain of the 
sail. Second, we show how to choose an infinite sequence of special polyhedron 
approximations for the sail. Finally, using a picture of these approximations, we 
formulate a conjecture on a fundamental domain of the sail. 


Proposition 24.7. Let V be a vertex of the sail of the n-dimensional continued frac- 
tion of an (n+ 1)-algebraic irrationality. Then there exists a fundamental domain 
of the sail such that all vertices of this domain are contained in the convex hull H of 
the origin and of the 2” distinct points of the form 


n 


where €; € {0,1} for 1 <i<n. 


Proof. Consider the polyhedral cone C with vertex at the origin and base at the 
convex polyhedron with vertices Ve, ¢,. We take the union of all images of this 
polyhedral cone under the actions of the operators with matrices 


for 1 <i<n, where m; € Z. Obviously this union is equivalent to the union of 
the whole open invariant cone and the origin. Therefore, every vertex of the sail is 
obtained from a vertex contained in the cone C by applying an operator with matrix 
Am,,...m, for some integers m;. Moreover, the convex hull of all integer points of the 
given invariant cone contains the convex hull of the vertices of the form Aj, im, (V). 
Hence the sail (i.e., the boundary of the convex hull of integer points) is contained 
in the closure of the complement in the invariant cone to the convex hull of all 
integer points of the form Am, |... (V). This complement is a subset of the union of 
polyhedra obtained from H by an action of some operator with matrix Aj, ....m, (for 
some integers mj, 1 <i<n). 


We skip the classical description of the computation of the convex hull for the 
integer points contained in the polyhedron H. Denote the vertices of this convex hull 
by V, for 0 <r <N. Here N is the total number of such points. 


Definition 24.8. The convex hull of the finite set of points 


{Am,,...m,(Vr) | 1 <mj <m,Vi:0<i<n} 
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is called the nth special polyhedron approximation for the sail. 


The defined set contains approximately, but fewer than, m”N points. (Since we 
calculated some image points for the boundary of H several times, we do not 
know the exact number of points.) The number WN is fixed for the given genera- 
tors A,,...,A, while m varies. We should try to make a conjecture with the least 
possible m. 


Remark 24.9. For all the examples listed in the paper [96] (and for the example 
of the last section) it was sufficient to take m = 2 to produce the corresponding 
conjectures. 


Remark 24.10. The “quality” of the approximation strongly depends on the choice 
of the basis of = (A). 


Remark 24.11. Note that though the algorithm works for n+1 arbitrary linearly 
independent matrices of 2+(A), it is not necessary to find generators of =,(A). 
Suppose that we know matrices A1,...,A, that generate only some full-rank sub- 
group of the group 2, (A). Let the index of this subgroup be equal to k. Then we are 
faced with the following two problems. Firstly, the number N will be approximately 
k times greater than in the previous case. Secondly, one should also find a conjecture 
on generators of the group 2,(A). 


24.2.4 Test of the conjectures produced in the two-dimensional 
case 


In this section we explain how to test conjectures for the case of two-dimensional 
periodic continued fractions (for more information we refer to [103]). The test con- 
sists of seven stages. We prove here that these seven stages are sufficient for the 
verification of whether the conjecture produced is true. The complexity of these 
stages is polynomial in the number of all faces. 


24.2.4.1 Brief description of the test stages and formulation of the main results 


Suppose that we have a conjecture on some fundamental domain D for some sail 
of a two-dimensional periodic continued fraction associated to some integer irre- 
ducible real spectrum matrix A. Let B,, Bz be the basis of the group =,(A). Let px; 
(for k = 0, 1,2) be the number of k-dimensional faces of the fundamental domain D. 
Denote by F; (i = 1,..., pz) the two-dimensional faces, i.e., polygons. All vertices 
and edges adjacent to each face are known. It is also conjectured that the fundamen- 
tal domain D and the basis B,, By» satisfy the following conditions: 

(i) the closure of the fundamental domain is homeomorphic to the two-dimensional 
disk; 

(ii) B, and B>2 define the gluing of this disk to the n-dimensional torus T? (the fun- 
damental domain D is in one-to-one correspondence with this torus). 
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Test of the conjecture. Our test consists of the following seven stages: 

1. test of condition (1); 

2. test of condition (ii); 

3. calculation of all integer distances from the origin to the two-dimensional planes 
containing faces F; and verification of their positivity; 

4. test the nonexistence of integer points inside the pyramids with vertices at the 
origin and bases at F; (here the integer points of faces F; are permitted); 

5. test the convexity of dihedral angles (for all edges of the fundamental domain); 
6. verification that all stars of the vertices are regular; 

7. test whether all vertices of D are in the same invariant cone. 


Let us give several technical definitions related to conditions 5 and 6. 


Definition 24.12. (i) A dihedral angle is called well-placed if the origin is contained 
in the corresponding opposite dihedral angle. 


(ii) Consider an arbitrary polyhedral surface P and its vertex v. The n-star of the 
vertex v in P is the union of all faces of P of dimension not greater than n containing 
v. 


Let us now define the regular stars at vertex v of a fundamental domain. First, we 
construct a 2-star R, of the conjectured sail using the algorithm of sail reconstruc- 
tion (see page 300). Second, let p : W —> T” be the universal covering of the torus 
obtained by gluing the fundamental domain via the shift operators. Take any vertex 
w € W corresponding to the vertex v. Denote the 2-star at w € W by R,,. Denote the 
canonical projection of Ry to Ry by &. 

If v £ (a,0,0) for some positive a, then we set ¥,, = (1/m,0,0) for m € Z.. (If 
v = (a,0,0), then we set ¥, = (0,1/m,0).) A 2-star at v is called regular if for the 
sequence of rays J, with vertex at the origin and passing through the point v+ V,, 
there exists a positive k such that for every m > k the following holds: the set 


E"(Im Ru) 
consists of exactly one point. 


Let us first formulate a theorem on complexity of the conjecture test. 


Theorem 24.13. The described conjecture test for the fundamental domain D re- 
quires every than 
C(po + pi + p2)* 


additions, multiplications, and comparisons of two integers, where C is a universal 
constant that does not depend on the pj. 


We skip the proof of this technical theorem here and refer the interested reader 
to [103]. 
Remark 24.14. Note that here we do not take into account the complexity of addi- 
tions, multiplications, and comparison of two large integers. We think of any such 
operation as a single operation (as a unit of time). There are some known bounds 
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for the number of digits of the integers that are linear with respect to the coefficients 
of the matrix A. So the complexity should be multiplied by some polynomial in the 
coefficients of A. 


Theorem 24.15. Let the set of faces D satisfy the following conditions: 

(1) condition (i); 

(2) condition (ii); 

(3) positivity of all integer distances from the origin to the two-dimensional planes 
containing faces F;; 

(4) there are no integer points inside the pyramids with vertices at the origin and 
bases at F; (here integer points on the faces F; are permitted); 

(5) all dihedral angles are convex; 

(6) all stars of the vertices are regular; 

(7) all vertices of D are contained in the same invariant cone. 

Then D is a fundamental domain of some sail of the continued fraction associated 
to the matrix A. 


Let us prove that these seven stages are sufficient for the test. 


24.2.4.2 Lemma on the injectivity of the face projection 


We prove Theorem 24.15 in four lemmas. 

First let us give the necessary notation. Let the matrices B; and Bz generate 
=, (A). For any integers n and m, we write Bn,m for the matrix B7B%’. We suppose 
that our domain D satisfies Conditions 1-7 of Theorem 24.15. Let 


U= |). Bia). 


nmeZ 


Consider the two-dimensional unit sphere S? centered at the origin O. We denote 
by z the following map: 
m:R?\OS, 


where every point x € R* \ O maps to the point at the intersection of S* and the ray 
with vertex at the origin and containing x. 


Lemma 24.16. For any face of the polygonal surface U, the map 1 is welldefined 
and injective on it. 


Proof. Consider any two-dimensional face F of the surface U. By condition 3, the 
distance from the origin to the plane containing F is greater than zero. Hence this 
plane does not contain the origin. Then 7 is welldefined and injective on F. 

Let now E be some edge of U. By conditions 1 and 2, this edge is adjacent to 
some two-dimensional face and therefore is contained in some plane that does not 
pass through the origin. Thus the line containing E does not pass through the origin. 
Hence 7 is well defined and injective on E. 

The injectivity for the vertices is obvious. 
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24.2.4.3 Lemma on the finite covering of the fundamental domain 


Let x € R? \ O. Denote by N, the tetrahedral angle with vertex at the origin and base 
with vertices x, B, (x), B}B2(x), and B2(x). Notice that 


( U Pra)) \O 


nmeZ 


is one of eight invariant cones of the continued fraction associated to A that contains 
x. Note that from conditions 1, 2, and 6 it follows that all points of D are contained 
in one open invariant cone, which we denote by C. 


Lemma 24.17. Let x be some point of the open invariant cone C. Then the union of 
all faces of D is contained in a finite union of solid angles of the type Bn m(Nx). 


Proof. By Dirichlet’s unit theorem it follows that for every interior point a of the 
open invariant cone C there exists an open neighborhood satisfying the follow- 
ing condition. The neighborhood can be covered by four solid angles of the type 
Bnm(Nx) when a belongs to an edge of some B;,;(N,), by two solid angles when a 
belongs to the face of some By; (Nx), and by one solid angle in the remaining cases. 
In every case, the neighborhood can be covered by some finite union of solid angles 
of type Bn m(Nx). 

Consider a covering of D by such neighborhoods that correspond to each point of 
the closure of D. Since the closure of D is closed and bounded in R?, it is compact. 
Hence this covering contains some finite subcovering. Therefore, the union of all 
faces of D is contained in the finite union of solid angles of type Bnm(Nx)- 


Corollary 24.18. Let x be contained in the open invariant cone C. Then the solid 
angle N, contains only points from a finite number of fundamental domains of the 
type Bn m(D). 


Proof. From the last lemma it follows that D is contained in the finite union 


i 
U Brym (N,) (for some positive /). Then the solid angle N, can contain only points 
k=1 


of the fundamental domains B_n,,—-m,(D) for 1 <k <1. 


24.2.4.4 Lemma on the bijectivity of the projection 


Lemma 24.19. The map 7 bijectively takes the polygonal surface U to the set S* 0 
G. 


Proof. As was shown above, the surface U is contained in C and is taken to S* VC 
under the map 7. 

Let us introduce the following notation. By condition 2, the action of the opera- 
tors with matrices B; and Bz determines a gluing of the fundamental domain. After 
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gluing we obtain a torus, denoted by 7”. Let W be the universal covering of T?. 
The face decomposition on T° lifts to a face decomposition on W. There is a natural 
two-parameter family (with two integer parameters) of projections Pym :W — U 
that map faces to faces (since the group of shifts By; acts on U). Let us choose one 
of these projections and denote it by p (p: W > U). 

Consider the map 20 p: W — S?. This map does not have branch points at the 
images of open faces of W, since every face of W bijectively maps to some face of 
U, and the corresponding face of U injectively maps to S* AC by Lemma 24.16. 

Two faces with a common edge of the universal covering W map to some two 
faces with common edge of the surface U. Such faces of U generate a well-placed 
dihedral angle, and hence also injectively map to S* 1C. So the map 70 p does not 
have branch points at the images of open edges. 

We now consider some vertex v of W. The edges and faces of W with common 
vertex v, by condition 6, form a regular 2-star. These edges also map to some edges 
of U with common vertex. Thus there exists a sequence of points that tends to 70 
p(v) (contained in $”) such that the preimage of every point of the sequence has 
exactly one preimage in the 2-star of v. Hence 70 p does not have branch points in 
the branch containing the star at 20 p(v). Therefore, 20 p does not have any branch 
points at the vertices. So the map 20 p : W —> S*C does not have branch points. 

Consider an arbitrary point x € S?C and the solid angle N, corresponding to 
it. Let x; and x2 be two points of S7.N,. We will now show that the preimages 
(10 p)~!(x,) and (x0 p)~!(x2) contain the same number of points. Let us join 
the points x; and x2 by some curve inside S* 1 N,. By Corollary 24.18, we know 
that the preimage of this curve is contained in a finite number of faces of W. Since 
there are no branch points in any face (and their number is finite) and there are no 
boundary faces of W, the number of preimages for 790 p is some (finite) discrete and 
continuous function on this curve. Therefore, the number of preimages for 7 0 p of 
any two points of S? MN, is the same. Hence the number of preimages for 70 p of 
any two points of S* C is the same. 

From this we conclude that the projection 70 p of the universal covering W 
(homeomorphic to an open disk) to S* NC (i.e., homeomorphic to an open disk) is 
a nonramified covering with finitely many branches. Since the covering (of an open 
disk by an open disk) is piecewise connected, the number of branches equals one. 
Since by definition, p : W — U is surjective and by all of the above it is injective, 
the maps p: W + U and 7: U > S*NC are bijective. 


24.2.4.5 Lemma on convexity 


Since every ball centered at the origin contains only a finite number of vertices of 
U (and they do not form a sequence tending to the boundary of the invariant cone 
C), the polyhedral surface U divides the space R? into two connected components. 
Denote by H the connected component of the complement to U that does not contain 
the origin. 
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Lemma 24.20. The set H is convex. 


Proof. Suppose that some plane passing through the origin intersects the polygonal 
surface U and does not contain any vertex of U. By Lemma 24.19 such a plane 
intersects U at some piecewise-connected broken line with an infinite number of 
edges. The complement of the plane of this broken line consists of two connected 
components. By assumption all vertices of this broken line are contained in open 
edges of U. By condition 5 all dihedral angles of U are well-placed. Thus the angle 
at every vertex of intersection of H with our plane is less than a straight angle. Hence 
by the previous lemma the intersection is convex. 

Consider the set of all planes that pass through the origin, intersect U, and do not 
contain vertices of U. This set is dense in the set of all planes passing through the 
origin and intersecting U. Therefore, by continuity it follows that the intersection of 
H with any plane passing through the origin (and intersecting U) is convex. 

Now we prove that the set H is convex. Let x; and x2 be some points of H. 
Consider the plane that spans x), x2, and the origin. This plane intersects U, since x 
is in H and the origin is not in H. By the above, the intersection of H with this plane 
is convex. Hence the segment with endpoints x; and x2 is contained in H. Thus H is 
convex (by the definition of convexity). 


24.2.4.6 Conclusion of the proof of Theorem 24.15: the main part 


So the constructed polygonal surface U has the following properties: 
— by Lemma 24.20, U bounds the convex set H; 

— by construction, all vertices of U are integer points; 

— by Condition 4, the set C \ H does not contain integer points. 

Therefore, the polygonal surface U is the boundary of the convex hull of all 
integer points inside C. Thus by definition, U is one of the sails of the continued 
fraction associated to the matrix A. 

Let us formulate one important conjecture here. 


Conjecture 29. Conditions 1-6 imply condition 7. 


24.2.5 On the verification of a conjecture for the multidimensional 
case 


Here we briefly outline an idea of how to test a conjecture on a fundamental domain 
of a multidimensional continued fraction. 


Conjecture for the multidimensional case. Suppose that we have a conjecture on 
some fundamental domain D, and also some basis B,,...,B, of the group 24 (A). 
Also let the fundamental domain and the basis have the following properties: 

(i) the closure of the fundamental domain is homeomorphic to the disk; 
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(ii) the operators with matrices B),...,B, define the gluing of this disk to the n- 
dimensional torus. 


How does one test the conjecture for fundamental domains of multidimensional 
continued fractions ? The verification of conditions (i) and (ii) is straightforward and 
is omitted. If these conditions hold, we check whether all the n-dimensional faces 
of the fundamental domain are faces of the sail. This can be done in the following 
way. 

Suppose that the integer distances from the origin to the planes of faces F; are 
equal to dj (i= 1,...,p, where p is the number of all n-dimensional faces). Our 
conjecture is true if and only if for alli =1,...,p the following conditions hold: 
(a) For every integer d < d; consider the plane parallel to the face F; with integer 
distances to the origin equal to d. The intersection of our invariant cone with this 
plane does not contain any integer point. 

(b) For d = dj the convex hull of all integer points in the intersection coincides with 
face F;. 

The verification of conditions (a) and (b) is quite complicated from the algorithmic 
point of view. 

We conclude this section with the important inverse question of constructing pe- 
riodic continued fractions. 


Problem 30. (V.I. Arnold.) Does there exist an algorithm to decide whether a given 
type of fundamental domain is realizable by a periodic continued fraction? 


The answer to this question is unknown even for the two-dimensional periodic 
continued fractions. 
24.3 An example of the calculation of a fundamental domain 


Let us construct fundamental domains for two-dimensional continued fractions of 
Example 22.15. Recall that 


01 0 
Amn=|00 1 
1 —m—n 


Theorem 24.21. Let m= b—a—1,n=(a+2)(b+1) (where a,b > 0). Consider 
the sail for the matrix Am» containing the point (0,0, 1). Let 


A= (1,0,a+2), B= (0,0,1), 
C=(b-—a-—1,1,0), D=((b+1)?,b+1,1). 


Then the following set of faces forms one of the fundamental domains: 
(1) the vertex A; 


24.3 An example of the calculation of a fundamental domain 345 


D C 


A B 
a,b>0 


Fig. 24.1 The closure of the fundamental domain of a sail of a fraction associated to the matrix 
Ap_—a—1,(a+2)(b+1) (here b = 6, and a is arbitrary). 


(2) the edges AB, AD, and BD; 
(3) the triangular faces ABD and BDC. 


The closure of the fundamental domain is homeomorphic to the square shown in 
Fig. 24.1 (for the case of an arbitrary a, and b = 6). 


Proof. Steps 1 and 2. We omit the first and the second steps (these steps are classical; 
see [41]) and here write down the result. The following two matrices generate the 
group ©, (A): 

Xab=Anw Yab=Amn(Amn — (b+ 1)Id), 


m,n? mn\4*mn 
where Id is the identity element in the group SL(3,Z). 
Step 3. We prove that (0,0,1) is a vertex of the sail. Consider the plane passing 
through A, B, and D: 
(-l—a)x+ (ab+at+b+l)y+z=1. 
When the equations (in variables x, y, and z) 


(-l—a)x+ (abt+at+b+l)y+z=a 


do not have any integer solution for 0 < a < 1, the integer distance from ABD to 
the origin is equal to one. There are exactly three integer points (A, B, and D) in 
the intersection of the plane and the invariant cone. We leave the proof of this as an 
exercise for the reader. 


Step 4. The conjecture of the fundamental domain was produced in the statement of 
this theorem. 


Step 5. It remains to test the conjectured fundamental domain. For the test we need 
some extra points: 
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E=X,,(B) 
= (1,-ab—a— 2b —2,a7b? + 2a7b 4 4ab? +a? + 8ab+4b? +5a+7b+5); 
F =Y,)(B) = (—a—2,1,0); 
H =X,)(F) = (0,—b—1,ab? + 2ab + 2b" +a + 4b +3). 


1. (Test of condition (i)). It can be shown in the usual way that the faces have 
common edge BD, and the edges intersect only at vertices. This implies that all 
adjacencies are correct, and that only one or two faces are adjacent to each edge. 
The closure of the boundary is a closed broken line ABCDA, homeomorphic to the 
circle. 


2. (Test of condition (ii)). Direct calculations show that the operator with matrix 
Xz,» takes the segment AB to DC (the point A goes to the point D and B to C). The 
operator with matrix Y, takes AD to BC (A goes to B and D to C). Obviously no 
other points glue together. The Euler characteristic of the obtained surface equals 
2—3-+1,1e., zero, and the surface is orientable. 


3. (Calculation of all integer distances from the origin to the two-dimensional 
planes containing faces.) The integer distance to the plane of ABD equals 


2 
; 1 0b7+2b+1 rae 
Thal aeeiy: Bs 


The integer distance from the origin to the plane of BDC equals 


0 b*+2b+1b-—a-1 
= SA “baad 1 
1 1 0 


_ ab+2b+a+2 _ 


2, 
PLA ee 


4. (Test on nonexistence of integer points inside the pyramids with vertices at 
the origin and bases at the faces.) Since the integer distance from the origin to 
the plane containing ABD equals one, the pyramid corresponding to ABD does not 
contain integer points different from O and the points of the face ABD. 

The face BDC is integer-linear congruent to the face with vertices (1,a+1,—a— 
2), (b+2,a+1,—a—2), (1,a+2,—a—2) of the the list “T-W” (see Fig. 19.2). The 
corresponding transformation taking BCD to the face of the list “T-W” is as follows: 


b+1b—a—-1b—-a 
1 1 1 


By Theorem 19.5 the pyramid corresponding to BDC does not contain integer points 
different from O and the points of the face BDC. 

5. (Test on convexity of dihedral angles.) Let us first consider the edge BD. This 
edge is adjacent to the faces ABD and BDC. The face ABD is contained in the plane 
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fasp(x,y,Z) = 0, and the face BDC is contained in the plane fgpc(x,y,z) = 0, where 


fapp(x,y,Z) = (-l—a)x+ (abt+atb+l)y+z-1; 
feoc(x,y,2) =x+(b+ly—(a+2)z+(a+2). 


To test that the dihedral angle corresponding to the edge BD is well-placed it is 
sufficient to verify the following: the point C and the origin O lie in different half- 
spaces with respect to the plane spanned by the points A, B, and D and the points A 
and O lie in different half-spaces with respect to the plane spanned by the points C, 
B, and D. So we need to solve the following system: 


een - fasp(O) <0, 
fepc(A)- fepc(O) < 0. 


This system is equivalent to 


(a7 +3a+2)-(—1) <0, 
{, a* —3a—1)-(a+2) <0. 


Since a > 0, the inequalities hold. Thus the dihedral angle associated with the 
edge BD is well-placed. Since the cases of dihedral angles associated to the edges 
AB (and the faces ADB and AEB) and BC (and the faces BDC and CBF) can be 
verified in the same way, we omit their descriptions. This concludes the test of Con- 
dition 5. 

6. (Verification that all 2-stars of the vertices are regular.) There is only one 
vertex in the torus decomposition. Any lift of this point to the universal covering W 
is adjacent to six edges and six faces. Consider a vertex of the universal covering 
that maps to the point B. The corresponding 2-star maps to six edges BC, BD, BA, 
BE, BH, and BF and to six faces BCD, BDA, BAE, BEH, BHF, and BFC, where 


H =X_,(F) = (0,—b—1,ab? +2ab + 2b +a +4b +3). 


We will check that for every sufficiently small positive €, a ray Je with vertex at the 
origin and passing through the point P, = (€,0, 1) intersects exactly one of the faces 
of 2-stars. 

First we will check that for every sufficiently small positive €, the ray /, intersects 
the triangle BCF, or equivalently, that the ray J, is contained in the trihedral angle 
with vertex at the origin O and base in the triangle BCF. The two-dimensional face 
of the trihedral angle containing B, C, and O can be defined by f4g0 = 0, and the 
two-dimensional face of the trihedral angle containing B, F, and O can be defined 
by fero = 0; the two-dimensional face of the trihedral angle containing C, F, and 
O can be defined by {cro = 0, where 


feco(*,y,z) =x+(a+1—b)y; 
faro(x,y,z) = x+ (at2)y; 
fcro(%,y,2) = 2. 
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For every sufficiently small positive €, the ray /¢ is contained in the dihedral angle 
defined above if the following conditions hold: the points P, and F are in the same 
closed half-space with respect to the plane fgco = 0, the points Pe and C are in the 
same closed half-space with respect to the plane fgrq = 0, and the points P, and 
B are in the same closed half-space with respect to the plane fcrg = 0. Since the 
points P, and B are close to each other for sufficiently small €, they are in the same 
closed half-space with respect to the plane fcro = 0. We now check the remaining 
two conditions: 


aaa - feco(F) = 9, us { (—b—l)e > 0, 
fero(Pe): faro(C) = 0, (b+1)e > 0. 


Since b,€ > 0, the first inequality does not hold. Thus for every sufficiently small 
positive €, the ray /,¢ does not intersect the triangle BCF. 

The cases of the triangles BCD, BDA, BAE, BEH, and BHF are similar to those 
described above and are omitted here. 

The ray /, (for any sufficiently small positive €) intersects the bijective image of 
the 2-star of the vertex at exactly one point contained in the edge AB. Therefore, all 
2-stars associated to the vertices are regular. 

7. (Test that all the vertices of D are in the same invariant cone.) The test of the 
seventh stage for this theorem is trivial, since D contains exactly one vertex. 


24.4 Exercises 


Exercise 24.1. Construct fundamental domains of the matrices in Examples 22.9— 
22.12 and prove that the corresponding triangulations are correct. 


Check for 
updates | 


Chapter 25 
Gauss Reduction in Higher Dimensions 


In this chapter we continue to study integer conjugacy classes of integer matrices in 
general dimension. Recall that matrices A and B in SL(n, Z) are integer conjugate if 
there exists a matrix C in GL(n,Z) such that 


B=CAC"!. 


The natural problem here is as follows: describe the set of integer conjugacy classes 
in SL(n,Z). In this section we give an answer to this question for matrices whose 
characteristic polynomials are irreducible over the field of rational numbers. 


Gauss‘s reduction theory gives a complete geometric description of conjugacy 
classes for the case n = 2, as we have already discussed in Chapter 11. In the multi- 
dimensional case the situation is more complicated. It is relatively simple to check 
whether two given matrices are integer conjugate (see, for instance, [6] and [80]), 
but to distinguish conjugacy classes is a much harder task. In this chapter we de- 
scribe a generalization of Gauss’s reduction theory to the multidimensional case 
(see also [107]). We study questions related to the three-dimensional case in more 
detail. 


There exists an alternative algebraic approach to the study of SL(n, Z) conjugacy 
classes, which we do not touch in this book. There one starts with GL(n,Q) con- 
jugacy classes and splits them into GL(n,Z) conjugacy classes. This reduces the 
problem to certain problems related to orders of algebraic fields that are defined by 
the roots of the characteristic polynomial of the corresponding matrices. 


25.1 Organization of this chapter 


In this chapter we investigate the following five aspects. 
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I. Generalized reduced matrices. In the multidimensional Gauss’s reduction the- 
ory, Hessenberg matrices play the role of reduced matrices. A Hessenberg matrix 
is a matrix that vanishes below the superdiagonal (see [208]); they were introduced 
by K. Hessenberg in [85]. Hessenberg matrices were essentially used in the QR- 
algorithm for the eigenvalue problem (see [59], [213], and [167]). Further, in [107] 
they were considered in the framework of the multidimensional Gauss reduction 
theory as a multidimensional analogue of reduced matrices. 

In Section 25.2 we introduce a natural notion of Hessenberg complexity for Hes- 
senberg matrices. Further we show that each integer conjugacy class of SL(n,Z) 
has only finitely many distinct Hessenberg matrices with minimal complexity (The- 
orem 25.8). 


II. Complete invariant of integer conjugacy classes. In Section 25.3 we intro- 
duce a complete geometric invariant of a Dirichlet group. It is a periodic multi- 
dimensional continued fractions in the sense of Klein—Voronoi (Theorem 25.23). 
Klein—Voronoi continued fractions naturally generalize Klein continued fractions. 
Further, we deduce the complete invariants of integer conjugacy classes of GL(n, Z) 
matrices. We show that the conjugacy classes are represented by periodic shifts of 
Klein—Voronoi periodic continued fractions (Theorem 25.24). 


Il. Calculation of reduced matrices. In Section 25.4 we introduce a technique to 
construct reduced matrices in any integer conjugacy class that has minimal Hessen- 
berg complexity in this class. 


IV. Solution of Diophantine equations related to certain decomposable forms. 
In Section 25.5 we study the integer points at the level sets of the decomposable 
forms (we have already discussed this question in Chapter 11 in connection with the 
Markov spectrum). As a technical detail we introduce a notion of multidimensional 
w-continued fractions similar to its one-dimensional version of Chapter 14. 


V. Integer conjugacy classes of SL(3,Z). We examine the structure of integer con- 
jugacy classes of SL(3,Z) via Klein—Voronoi continued fractions. The main re- 
sults concern only the operators with a pair of complex conjugate eigenvectors. It 
turns out that in this case, Hessenberg matrices distinguish corresponding conjugacy 
classes asymptotically (Theorem 25.48). The analogous statement does not hold for 
the case of operators with three real eigenvalues. In general it is much less known 
for the case of three real eigenvalues. 


25.2 Hessenberg matrices and conjugacy classes 


In Section 25.2.1 we begin with necessary definitions and notation related to Hes- 
senberg matrices. Further, in Section 25.2.2 we discuss algorithmic aspects of con- 
struction of perfect Hessenberg matrices integer congruent to a given one. In Sec- 
tion 25.2.3 we show that every integer conjugacy class with irreducible characteris- 
tic polynomial has a finite nonzero number of ¢-reduced matrices. Further in Sec- 
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tion 25.2.4 we study the set of perfect Hessenberg matrices. We think of this set as 
a “book” whose “pages” are enumerated by Hessenberg types. Since the matrices 
from the same page are distinguished by their polynomials, any two matrices on one 
page are not integer congruent. We conclude this section with a brief discussion of 
c-reduced 2-dimensional matrices of the same Hessenberg type. 


25.2.1 Notions and definitions 


In this subsection we briefly introduce Hessenberg matrices, which we consider as 
generalized reduced matrices in the multidimensional case. 


25.2.1.1 Perfect Hessenberg matrices 


A matrix M of the form 


411412°°* Gin-2 An-1 Ain 

421 422°** G2n-2  42n-1  G2n 
0 a32°** Gn-2 @n-1 43n 
0 0 +++ ay 1,n—2 Gn—1,n—-1 Gn-1,n 
0 O 0) Ann—-1 Ann 


is called an (upper) Hessenberg matrix. We say that the matrix M is of Hessenberg 
type 
(a1,1,42,1 |a1,2,42,2,43,2| ia [41 n—1)42,n-1 or oe Origa): 


Definition 25.1. A Hessenberg matrix in SL(n, Z) is said to be perfect if for every 
pair of integers (i,j) satisfying 1 <i < j+1 <n the following inequalities hold: 
0 <ajj < aj41,j. 


In other words, all elements in the first n—1 columns of a perfect Hessenberg 
matrix are nonnegative integers, and in addition, the element aj+ ,; is maximal in 
each of these columns. 


25.2.1.2 Reduced Hessenberg matrices 


We study only the simplest general case of SL(n,Z) matrices whose characteris- 
tic polynomials are irreducible over Q. Every such matrix is integer conjugate to a 
perfect Hessenberg matrix, and each conjugacy class of SL(n, Z) contains infinitely 
many perfect Hessenberg matrices. To reduce the number of such matrices, we in- 
troduce a natural notion of complexity. 
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Definition 25.2. Consider a Hessenberg matrix M = (a;,;). The integer 


is called the Hessenberg complexity of M. Denote it by ¢(M). 


The Hessenberg complexity is in fact equivalent to the volume of the paral- 
lelepiped spanned by v = (1,0,...,0), M(v), M?(v),...,M~!(v), as we will discuss 
in Section 25.4.1. 

An integer Hessenberg matrix has unit Hessenberg complexity if and only if 


a2. =" =4nn-1 = 1. 


In the literature such matrices are known as Frobenius matrices (or companion ma- 
trices). The elements of the last column of any Frobenius matrix are exactly the 
coefficients of the characteristic polynomial multiplied alternately by +1. 


Example 25.3. Consider the matrix 


12 3 
23 6 
05-1 


This matrix is a perfect Hessenberg matrix of type (1,2|2,3,5). Its Hessenberg com- 
plexity is 27-5 = 20. 


Definition 25.4. A perfect Hessenberg matrix M is said to be ¢-nonreduced if there 
exists an integer matrix M’ of smaller Hessenberg complexity integer congruent to 
M. Otherwise, we say that M is ¢-reduced. 


Remark. Let us say a few words about the relation between reduced matrices intro- 
duced in Chapter 11 and ¢-reduced matrices in SL(2,Z). On the one hand, every 
reduced hyperbolic (i.e., real spectrum) matrix is a perfect Hessenberg matrix. On 
the other hand, every ¢-reduced matrix is also reduced, although there are certain 
reduced matrices that are not ¢-reduced. 


Later, in Theorem 25.8, we show that the number of ¢-reduced matrices is finite 
for every integer conjugacy class. Some classes contain more than one ¢-reduced 
perfect Hessenberg matrix. 


Example 25.5. The ¢-reduced Hessenberg matrices (with Hessenberg complexity 
equivalent to 3) 


012 023 
M,;={|100 and Mo=|{111 
035 034 


are integer conjugate. 
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25.2.2 Construction of perfect Hessenberg matrices conjugate to a 
given one 


In this subsection we show how to construct perfect Hessenberg matrices that are 
integer congruent to a given one. The main ingredients of the algorithm are in the 
proof of the following proposition. 


Proposition 25.6. Consider an SL(n,Z) matrix M whose characteristic polynomial 
is irreducible over Q. Then for every vector v of integer length | there exists a unique 
matrix C satisfying the following conditions: 

—C(1,0,...,0) =v; 

— the matrix CMC™' is perfect Hessenberg (we denote this matrix by (M|v)). 


In other words, for a given integer operator A, every integer vector of unit integer 
length is uniquely extended to the basis of the integer lattice in which the operator 
A has a perfect Hessenberg matrix. 


Proof. Existence. Consider an SL(n,Z) matrix M with irreducible characteristic 
polynomial. Let A denotes the linear operator with matrix M. Consider an integer 
vector v of unit integer length. For i = 1,...,1— 1 we define 


V; = Span (v,A(v),A7(v),...,4'"(v)). 


Since the characteristic polynomial of M is irreducible, the set of all spaces V; forms 
a complete flag in R”. All the vectors A/(v) are integer vectors; hence every space 
V; contains a full-rank integer sublattice. 


Let us inductively construct an integer basis {é;} of a vector space R” such that: 
— for every i the vectors é),...,é; generate the sublattice Z” N V;; 
— the matrix of A is perfect Hessenberg in the basis {é;}. 


Base of induction. Let @, = v. Since 1€(é,) = 1, the vector é; generates Z"NV\. 


Step of induction. Suppose that we have already constructed é1,...,é, forming a 
basis of Z” 1 V;. Let us now calculate é,,,. Choose an integer vector g,+1 of the 
space V1; a unit integer distance to the space V;. Since é),...,é, generate Z” NV;, 
the vectors @1,...,€, 2x41 generate the sublattice Z” 1V;11. Since A(E) € Vi41, we 


have 
k 


A(Ex) = Yo gik@i+ Oe v1 kk 
i=l 
For i= 1,...,k, we define b;, and a;, as integer quotients and remainders of the 
following equations: 


ik = Dix . |ax+1,4\ +4ik, where O< Aik < A+ 1k: 


Then we have 
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k k 
A(€x) = lac+14l (sense + hus +) aj pe; 
i=1 i=l 


(where sign is the sing function over the real numbers). Finally, we get 


k 
Geri = sign(acs14) Seri +) dinéi- 
i=l 
The characteristic polynomial of A is irreducible over Q, and hence the integer 
spaces V; are not invariant subspaces of A. Therefore, the set {@),...,@41} is a 
basis of Z” 1 V;41. The induction step is complete. 


Denote by M the matrix of A in the basis {é;}. By construction, the matrix M is 
a perfect Hessenberg matrix, and its Hessenberg type is 


(ai, |a2,1||a1,2,42,2, lazal| 9 1Q1 1506+ An—1n—1) [Ann iI). 

Both M and M are matrices of A in two different integer bases, which means they 
are integer conjugate. The integers aj;;1; are nonzero (i = 1,...,n—1), since the 
characteristic polynomial of A is irreducible. Therefore, ¢(M) > 0. Denote by C the 
transition matrix to the basis {é;}. Then C(é;) = v and M =CMC7! (where M is a 


perfect Hessenberg matrix). 


Uniqueness. Let C,; and C) satisfy the conditions of the theorem. Suppose that C; 
and C> are the transition matrices to the bases {é;} and {é;} respectively. Then 


é) =Ci(e1) =v =Cr(e1) = 41. 


For i > 1 the equality é; = é; follows from the uniqueness of the choice of the coeffi- 
cients in the proof of existence. Hence the bases {é;} and {é;} coincide. Therefore, 
Ci =O. 


Now let us briefly summarize the algorithm. 
Algorithm to construct perfect Hessenberg matrices. 


Input Data: (M,v). Here M is a matrix of a lattice-preserving operator A whose 
characteristic polynomial is irreducible over g, and v is an integer vector of unit 
integer length. 


Step 1. Put é; =v. 


Inductive Step k. We have already constructed é),...,é,. Choose gx41 € Vis sat- 
isfying ld(gx41,Vk) = 1 and find integers gj, for i= 1,...,k and ay+1,¢ from the 


decomposition 
k 


A(é&) = y Gi ki + AKL ASK HL 
j=l 


Find b;; and aj, (i= 1,...,k) as integer quotients and reminders of the equations 
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ik = bik: Oks kl +Gix- 


Put 
k 


Geri = sign(arsi4)geri +), dix. 
ay 


Output Data: the perfect Hessenberg matrix CMC™!, where C is the transition 
matrix to {é}. 

Note that the basis {é;} is constructed in n steps. 

Later, in the proof of Theorem 25.31, we use the following corollary. 
Corollary 25.7. Consider an SL(n,Z) operator A with matrix M and let B € &(A). 
Then for an arbitrary v we have (M|v) = (M|B(v)). 


Proof. Each step of the algorithm produces the same data for v and B(v), due to the 
fact that A and B commute. Therefore, (M|v) = (M|B(v)). 


25.2.3 Existence and finiteness of ¢-reduced Hessenberg matrices 


It turns out that ¢-reduced Hessenberg matrices are good for the classification of 
conjugacy classes in SL(n,Z). Although they do not form complete invariants of 
conjugacy classes, they are present in each class and their number is finite. We give 
an explicit construction of all ¢-reduced Hessenberg matrices conjugate to a given 
one in Section 25.4.3 via Klein—Voronoi continued fractions. 


Theorem 25.8. Let M be an SL(n,Z) matrix. Then the number of ¢-reduced Hes- 
senberg matrices integer conjugate to M is finite and greater than zero. 


The proof of this theorem is based on the following proposition. 


Proposition 25.9. Every Hessenberg matrix with positive Hessenberg complexity is 
identified by its Hessenberg type and characteristic polynomial. 


Proof. Let M = (a;,;) be a Hessenberg matrix with positive Hessenberg complexity. 
The first n — 1 columns of M are entirely defined by the Hessenberg type of M. The 
last column is uniquely defined from the characteristic polynomial of M, 


Pie a aE ee ore arte er 


in the following way. For every k the coefficient c, is a polynomial in variables 
qj,j that does not depend on a y,...,d%,. This polynomial has a unique monomial 


containing d+) ,, namely 
n—1 
I] Aj+ij | Ck+1n- 


jak+1 
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Since ¢(M) # 0, the product in the parentheses is nonzero. Hence ag; is uniquely 
defined by c, and the elements a; ; of the first n — 1 columns. Therefore, all elements 
of M are defined by the Hessenberg type of M and the characteristic polynomial of 
M. 


Proof of Theorem 25.8. Existence. By Proposition 25.6 there exist perfect Hessen- 
berg matrices integer conjugate to M. The set of values of Hessenberg complexity 
is discrete and bounded from below; hence there exists a perfect Hessenberg matrix 
M integer conjugate to M and with minimal possible Hessenberg complexity. By 
definition the matrix M is a ¢-reduced. 


Finiteness. Let c be the Hessenberg complexity of M. By definition the Hessenberg 
complexity of all the other ¢-reduced Hessenberg matrices integer conjugate to M 
equals c. The number of Hessenberg types whose Hessenberg complexity equals 
c is finite. Notice that the integer conjugate matrices have the same characteristic 
polynomials. From Proposition 25.9 there exists at most one integer Hessenberg 
matrix with a given Hessenberg type and a given polynomial. Therefore, the number 
of ¢-reduced Hessenberg matrices integer conjugate to M is finite. 


25.2.4 Families of Hessenberg matrices with given Hessenberg type 


In this subsection we discuss the structure of Hessenberg matrices with given Hes- 
senberg type. We begin with a particular example. 


Example 25.10. Let us examine the Hessenberg type (0,1|1,0,2). The set of all 
matrices of this Hessenberg type is a two-parameter family with parameters m and 
n: 


011 000 001 
H((0,1|1,0,2)) = 100}]+m{001]+n/000] ImnEeZ 
021 000 002 
Define 
vishets O1ln+l1 
Ao 11.0.2) (mn)=|{10 m 
02 2n+1 


The discriminant of H, ul 05s (m,n) equals 


44 — 44n* — 56mn —32n? +32m? + 16m2n? + 16mn* + 16m2n—56n—8m+52m?. 


The set of matrices with negative discriminant for the given family coincides with 
the union of the sets of integer solutions of the following quadratic inequalities: 


2m < —n*—n—2 and 2n <m+m. 
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m+n=-1 x1) =0 


m 
——r 
Gees x(1) =0 


Fig. 25.1 Matrices of Hessenberg type (0, 1|1,0,2). 


In Fig. 25.1 we represent a matrix mae, (m,n) by the square in the inter- 
section of the mth column and the nth row. Matrices with reducible characteris- 
tic polynomials correspond to black squares. Light gray squares represent matrices 
with three real eigenvalues. Matrices shown as white squares have a pair of complex 
conjugate eigenvalues. 


It turns out that the general case is similar. Consider a Hessenberg type 


Q = (41,1,42,1|41,2,42,2,43,2|°°* |@1yn—15-++4n—1n—15 Ann) 


Denote by H(Q) the set of all Hessenberg matrices in SL(n, Z) of Hessenberg type 
Q. 
For k = 1,...,n2—1 we put 


v4 (Q) = (Gg Ayers Ge RELI Oyi040) 


and denote by M;(Q) the matrix whose first n—1 columns are equal to zero and the 
last one equals to v, (2). Denote also by o(Q) the (n—1)-dimensional simplex with 
vertices 

O (the origin), vj, ..-,  Vn—1- 


Theorem 25.11. (i) The set H(Q) is not empty if and only if 1V(o(Q)) = 1. 


(ii) Let Mo € H(Q). Then H(Q) is an integer affine (n—1)-dimensional sublattice 
in the lattice of all integer (n x n) matrices, i.e., 


358 25 Gauss Reduction in Higher Dimensions 


n—1 
H(Q)= {4 y ciMi(Q)fe1,..-56n-1 € x) ; 
i=1 


The proof of Theorem 25.11 is based on the following lemma. 


Lemma 25.12. Consider an operator A with integer Hessenberg matrix M of type 
Q. Let v be the vector standing in the last column of M. Then M € GL(n,Z) if and 
only if the following conditions hold: 

—IV(o(Q)) =1; 

— ld(v, Span(o(Q))) = 1. 


Proof. Necessary condition. Consider an operator A with Hessenberg GL(n,Z) 
matrix M in some integer basis {g;}. Denote by a the (n—1)-dimensional sim- 
plex with vertices 

O} 815. vy Bn-1- 


Since M € GL(n,Z), the operator A preserves integer volumes and integer dis- 
tances. Since 1V (St) = I, we have the first condition: 


IV(o(Q)) =1V(A(Sz-')) =1V(Sp-1) = 1. 
Notice that 
A(Span(S7~')) = Span(o(Q)) and A(n) =v. 
Therefore, we get the second condition: 
ld(v, Span(o(Q))) = 1d(gn,A(Span(Sz'))) = Id(gn,Span(g1,-.-,8n-1)) = 1. 


Sufficient condition. Suppose that both conditions of the lemma hold. Then the 
operator A preserves the integer lattice (generated by g1,...,g,). Therefore, M € 
SL(n,Z). 


Proof of Theorem 25.11. (i) Suppose that 1V(o(Q)) = 1. Choose an integer vector 
v at unit integer distance to the plane Span(o(Q)). By Lemma 25.12 both matrices 
defined by Q and vectors +v are in GL(n,Z). One of them is in SL(n,Z). Con- 
versely, if H(Q) contains an SL(n,Z) matrix, then by Lemma 25.12 the integer 
volume of o(Q) equals 1. 


Statement (ii) is straightforward, since the determinant of a matrix is an additive 
function with respect to the operation of vector addition in the last column. 


25.2.5 ¢-reduced matrices in the 2-dimensional case 


Let us say a few words about ¢-reduced 2-dimensional matrices in families H(Q). 
We start with the following example. 
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Example 25.13. Consider the family of matrices H ((3,4)): 
6 3m+ 5) 
44m+3)~ 
Their Hessenberg complexity equals 4. Almost all matrices of this family have 


irreducible, over Q, characteristic polynomials with two real roots (except for 
m= —1,—2). Form =0,1,2,3,... the LLS-periods of the matrices are 


(2,2); (1,2,1,1), (1,2,1,2), (1,2,1,3), eee | (1,2,1,m), 


The matrices are ¢-reduced for m > 2. For m = —3,—4,—5,... the corresponding 
LLS-periods are 


(4,1), (4,2), (4,3), (4,4), ..., (4,-2-m), 


Starting from m < —6 the matrices are ¢-reduced. 


The example of Hessenberg type (3,4) shows that almost all matrices of this 
Hessenberg type are ¢-reduced. That is actually the case for all Hessenberg types in 
SL(2,Z). 


Theorem 25.14. Almost all matrices of a given Hessenberg type in SL(2,Z) are 
c-reduced. 


This follows from Theorem 25.31 below and direct calculation of complexities 
for all vertices of the period; we skip the proof here. 


In Theorem 25.48 we prove similar behavior for SL(3,Z) matrices having two 
complex conjugate eigenvalues (see also Conjecture 31). 


25.3 Complete geometric invariant of conjugacy classes 


In this section we introduce a geometric complete invariant of integer conjugacy 
classes: multidimensional continued fractions in the sense of Klein—Voronoi. We 
start with necessary definitions in Section 25.3.1. In particular, we give a gen- 
eral definition of Klein—Voronoi continued fractions in all dimensions in Sec- 
tion 25.3.1.1 and discuss their algebraic periodicity. Further, in Section 25.3.2 we 
show that the Klein—Voronoi continued fraction for an operator A is a geomet- 
ric complete invariant of the corresponding Dirichlet group =(A). Finally, in Sec- 
tion 25.3.3 we discuss that periodic shifts distinguish integer conjugacy classes of 
SL(n,Z) matrices. 
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25.3.1 Continued fractions in the sense of Klein—Voronoi 


Approximately at the time of the work by F. Klein on continued fractions related 
to real matrices, G. Voronoi in his dissertation [220] introduced a geometric algo- 
rithmic definition that covers cases of matrices having complex conjugate eigen- 
values. In [32] and [33] J.A. Buchmann generalized Voronoi’s algorithm, making 
it more convenient for computation of fundamental units in orders. We use ideas 
of J.A. Buchmann to define the multidimensional continued fraction in the sense of 
Klein—Voronoi for all cases. Note that if all eigenvalues of an operator are real then 
the Klein—Voronoi multidimensional continued fraction is a continued fraction in 
the sense of Klein. 


25.3.1.1 General definitions 


Consider an operator A in GL(n,R) with distinct eigenvalues. Suppose that it has 
k real eigenvalues r),...,7% and 2/ complex conjugate eigenvalues c1,€1,...,¢1,€, 
where k + 2] =n. Denote by Lp(A) the space spanned by the real eigenvectors. 

Denote by T!(A) the set of all real operators commuting with A such that their 
real eigenvalues are all units and the absolute values for all complex eigenvalues 
equal one. In other words: 


Ts = {B € GL(n,R) | AB =BA, spec(B) C S!,Blr..(4) = Id|,5 ay}, 


where spec(B) is the spectrum of B and S! is the complex unit circle. In fact, T'(A) 
is an abelian group with the operation of matrix multiplication. 

For a vector v in R”, we denote by T,(v) the orbit of v with respect to the action of 
the group of operators T'(A). If v is in general position with respect to the operator 
A (i.e., it does not lie in invariant planes of A), then T,(v) is homeomorphic to the 
J-dimensional torus. For a vector of an invariant plane of A the orbit 7,(v) is also 
homeomorphic to a torus of positive dimension not greater than /, or to a point. 


Example 25.15. Suppose that A is a real spectrum matrix, i.e., / = 0. Since all its 
eigenvectors are real, T°(A) consists only of the unit operator and T4(v) = {v}. 


Example 25.16. Now consider the case of a pair complex conjugate eigenvalues, 
ie., 1 = 1. The group 7!(A) corresponds to elliptic rotations in the invariant plane 
of A corresponding to complex eigenvalues. Such rotations are parameterized by 
the angle of rotation. A general orbit of T4(v) is an ellipse around the (n—2)- 
dimensional invariant subspace corresponding to real eigenvalues. Every orbit in 
the invariant subspace spanned by real eigenvalues consists of one point. 


Let g; be a real eigenvector with eigenvalue r; for i= 1,...,k, and let gy42j-1 
and gx,2; be vectors corresponding to the real and imaginary parts of some com- 
plex eigenvector with eigenvalue c; for j = 1,...,/. Consider the coordinate system 
corresponding to the basis {g;}: 
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OX,X2...X4Y1Z, VoL... YZ. 


Denote by z the (k+/)-dimensional plane OX,X2...X;¥Y,¥Y2...¥. Let 17 be the 
cone in the plane 7 defined by the equations y; > 0 fori = 1,...,/. For every v the 
orbit T4(v) intersects the cone 7, in a unique point. 


Definition 25.17. A point p in the cone 7, is said to be 2-integer if the orbit T,(p) 
contains at least one integer point. 


Consider all (real) hyperplanes invariant under the action of the operator A. There 
are exactly k such hyperplanes. In the above coordinates, the ith of them is defined 
by the equation x; = 0. The complement to the union of all invariant hyperplanes in 
the cone 77, consists of 2 arcwise connected components C\(A),...,Cx(A). 


Definition 25.18. The convex hull of all z-integer points except the origin contained 
in an arcwise connected component C;(A) is called a factor-sail of C;(A). The set 
of all factor-sails is said to be the factor-continued fraction for the operator A. We 
denote it by $;(A). 


Definition 25.19. The union of all orbits T,4(*) in R” represented by the points in 
the factor-sail of C;(A) is called the sail of T,(C;), which we denote by S;(A). The 
union of all sails is said to be the continued fraction for the operator A in the sense 
of Klein—Voronoi (see Fig. 25.2 below). We denote it by KVCF(A). 


In algebraic language we write the factor sail as 
S;(A) = A (conv ({q € m4 | T4(q) NC(A) NZ" 4 0} \ {O})), 


and hence 


ok 
S(A)= (J T(p) and — KVCF(A) =(JS;(A). 
pes;(A) i=l 


Definition 25.20. The intersection of a factor-sail with a hyperplane in 7 is said 
to be an m-dimensional face of the factor-sail if it is homeomorphic to the m- 
dimensional disk. 

The union of all orbits in R” represented by points in some face of the factor-sail is 
called the orbit-face of the operator A. 

Integer points of the sail are said to be vertices of this sail. 


In the simplest possible cases of k+/ = 1, every factor-sail of A is a point. If 


k+l > 1, then every factor-sail of A is an infinite polyhedral surface homeomorphic 
to RK+/-1. 
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25.3.1.2 Algebraic continued fractions 


Consider now an operator A in the group GL(n, Z) whose characteristic polynomial 
is irreducible over Q. Suppose that it has k real roots and 2/ complex conjugate 
roots, where k+ 21 =n. 

Further, we consider the Dirichlet group © (A) of all GL(n, Z) operators commut- 
ing with A and the corresponding positive Dirichlet group =+(A) (for more details 
see Chapter 21). The Dirichlet group = (A) takes a Klein—Voronoi continued frac- 
tion to itself and permutes the sails. The positive Dirichlet group 2, (A) consists 
exactly of operators preserving every sail. By the Dirichlet unit theorem, the group 
=(A) is homomorphic to Z‘*!~! & G, where G is a finite abelian group. The group 
=, (A) is homeomorphic to Z+!—! and its action on any sail is free. The quotient of 
any sail by the action of =,(A) is homeomorphic to the (n—1)-dimensional torus. 


Definition 25.21. A fundamental domain of KVCF (A) is a collection of orbit-faces 
of KVCF (A) representing exactly one of the different classes of KVCF (A)/3(A). 
A fundamental domain of a sail S;(A) is a collection of orbit-faces of S;(A) repre- 
senting exactly one of the different classes of S;(A)/24(A). 


Example 25.22. Let us study an operator A with a Frobenius matrix 


001 
101 
013 


This operator has one real and two complex conjugate eigenvalues. Therefore, the 
cone 7, for A is a two-dimensional half-plane. In Fig. 25.2a the half-plane 7+ is 
shaded in light gray and the invariant plane corresponding to the pair of complex 
eigenvectors is in dark gray. The vector shown in Fig. 25.2a with endpoint at the 
origin is an eigenvector of A. 

In Fig. 25.2b we show the cone 71. The invariant plane separates 74 into two 
parts. The dots on 7, are the 7-integer points. The boundaries of the convex hulls 
in each part of 7, are two factor-sails. One factor-sail is taken to another by the 
induced action of —Id. 

Finally, in Fig. 25.2c we show one of the sails. Three orbit-vertices shown in the 
figure correspond to the vectors (1,0,0), (0,1,0), and (0,0, 1): the large dark points 
(0, 1,0) and (0,0, 1) are visible at the corresponding orbit-vertices. 

The positive Dirichlet group 2 (A) in our example is homeomorphic to Z, and 
is generated by A. The group =(A) is homeomorphic to Z@ Z/2Z with generators 
A and —Id. The operator A takes the point (1,0,0) and its orbit-vertex to the point 
(0, 1,0) and the corresponding orbit-vertex. Therefore, every fundamental domain 
of the continued fraction for the operator A contains one orbit-vertex and one ver- 
tex edge. For instance, we can choose the orbit-vertex corresponding to the point 
(1,0,0) and the orbit-edge corresponding to the “tube” connecting orbit-vectors for 
the points (1,0,0) and (0, 1,0). 
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Fig. 25.2 A three-dimensional example (Example 25.22): (a) the cone 7, and the eigenplane; (b) 
the continued factor-fraction; (c) a sail of the continued fraction. 


25.3.2 Geometric complete invariants of Dirichlet groups 


One of the main properties of Klein—Voronoi continued fractions is that they clas- 
sify all Dirichlet groups. 


Theorem 25.23. Let A,B € GL(n,Z) have characteristic polynomials irreducible 
over Q. Then © (A) = =(B) if and only if KVCF (A) = KVCF(B). 


Remark. If the characteristic polynomial of a matrix is irreducible over Q, then all 
its eigenvectors are distinct, so all matrices of Proposition 25.23 possess Klein— 
Voronoi continued fractions. 
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Proof. Supposing that = (A) = =(B) then A and B commute. Hence they have the 
same eigenvectors (since they do not have multiple eigenvalues) and the same orbits: 
T, = Tp. In addition, we choose the same cone 7 for both A and B. Therefore, by 
definition, the Klein—Voronoi continued fractions for A and B coincide. 


Let us prove the converse statement. Assume that the Klein—Voronoi continued 
fractions for A and B coincide. Suppose that A has real eigenvectors g1,..., 9, and 
complex conjugate eigenvectors gz42;-1 + V—I¢n42 j for j = 1,...,/, where k+ 
21 =n. Consider the coordinate system corresponding to the basis {g;}: 


OX,X2...X4Y1Z,YoZo... YZ. 


In these coordinates define the form ®, as follows: 


ae 
@,(x1,...,%n) = (Ts [105+ )) 
eA 


J=1 


Similarly we define the form  , for the operator B. Note that A preserves the form 
®, up to a multiplicative scalar: a simple calculation shows that for every v € R” 
we have 

@,(A(v)) = det(A)@y(v). 


The same is true for B and Bz. 

The Klein—Voronoi continued fraction for A (which is also KVCF (B)) asymp- 
totically coincides with the set 4 = 0 (and @g = 0 respectively) at infinity, and 
hence the matrices A and B have all the same invariant subspaces. In particular, 
their one-dimensional real eigenspaces corresponding to real eigenvectors and their 
two-dimensional eigenspaces (we denote them by 7, ..., 7) defined by pairs of 
complex conjugate roots coincide. This implies that A and B commute if and only if 
they commute for the vectors of the invariant planes 7, ..., 7. 


Let us show that operators A and B restricted to a plane 7; (i= 1,...,/) commute. 
For a vertex v of KVCF (A) we set 


T, = KVCF(A)O(v+ (m,.--,)). 


From construction, 7, coincides with the orbit T,(v). Since the space spanned by 
all planes 7,..., 7) is invariant for both A and B, the set 7, coincides with the orbit 
Tg(v), and hence Ty(v) = Tp(v). 

Consider an arbitrary matrix C of the set T4. Since all eigenvalues of C have unit 
modulus and C is diagonalizable in the eigenbasis of A, C preserves ®, up to a 
scalar det(C) = +1. Therefore, |@,| is constant on 7,4 (v). For the same reason, | ®g| 
is constant on Ty (v) = Tp(v). Therefore, by linearity, ®, = c- @g for some constant 
c #0. Hence, A preserves ®g up to a multiplicative scalar. 

Consider now the plane 2; for some 1 < j </ and take coordinates (y;,z;). It 
is clear that if an operator preserves ®g(v) up to a multiplicative scalar, then its 
restriction to the plane 7; preserves the form 
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2 2 
Vit Sj 


up to a multiplicative factor. There are two types of matrices that preserve the set of 


level sets of this form: 
ab aad —ab 
—ba ba 


with arbitrary real parameters a and b. The matrices of the second family have two 
real eigenvalues in 7;, which is by the above not the case for B. Therefore, both A 
and B are from the first family. All matrices of the first family commute. Hence A 
commutes with B in planes 7; forl1 <j </. 

Therefore, A and B are both diagonalizable in a certain complex basis. Hence A 
and B commute. Therefore, = (A) = =(B). 


25.3.3 Geometric invariants of conjugacy classes 


In this subsection we fix some integer basis in R” and identify operators with matri- 
ces in this basis. On the one hand, from Theorem 25.23 it follows that the Klein— 
Voronoi continued fraction for A identifies the Dirichlet group = (A) in a unique way. 
On the other hand, the operator A defines a shift of KVCF(A) along itself, which 
we denote by Py. It is clear that distinct operators of = (A) define inequivalent shifts. 
So every A € GL(n, Z) is uniquely identified with a pair (KVCF (A), P4). The group 
GL(n,Z) naturally acts on pairs (KVCF (A), P4) by left multiplication on the first 
factor and by conjugation on the second factor. We have the following important 
tautological theorem. 


Theorem 25.24. (On geometric complete invariant.) Two matrices A, ,A2 of the 
group GL(n,Z) whose characteristic polynomial is irreducible over Q are integer 
conjugate if and only if the pairs (KVCF (A,),Pa,) and (KVCF (Az), Pa, ) are in the 
same GL(n, Z)-orbit. 


Remark 25.25. The important consequence of this theorem is that all geometric 
GL(n, Z)-invariants of the Klein—Voronoi continued fraction for A (such as integer 
distances and integer volumes of certain integer point configurations of KVCF(A)) 
are invariants of the conjugacy class of A. Fundamental domains of Klein con- 
tinued fractions and their invariants in the real spectrum three-dimensional case 
(n = k =3,1 =0) were studied in works [130], [129], [134], [136], [96], [97], etc. 
Currently there is not much known about the construction of Klein—Voronoi con- 
tinued fractions in the nonreal spectrum case. 
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25.4 Algorithmic aspects of reduction to ¢-reduced matrices 


In Section 25.2 above we showed the existence and finiteness of ¢-reduced matri- 
ces in each integer conjugacy class of SL(n,Z) matrices. The aim of this section 
is to introduce techniques to construct ¢-reduced matrices integer conjugate to a 
given one. In Section 25.4.1 we give a geometric interpretation of Hessenberg com- 
plexity as a volume of a certain simplex, which is called the MD-characteristic. 
Further, in Section 25.4.2 we use the MD-characteristics to prove that the algorithm 
of Section 25.2.2 constructs all ¢-reduced matrices starting from integer vertices of 
a Klein—Voronoi continued fraction. The corresponding techniques are outlined in 
Section 25.4.3. 


25.4.1 Markov—Davenport characteristics 


In this subsection we characterize the Hessenberg complexity in terms of the 
Markov—Davenport characteristic. 


25.4.1.1 Definition of the MD-characteristic and its invariance under the 
action of the Dirichlet group 


The study of Markov—Davenport characteristics is closely related to the theory 
of minima of absolute values of homogeneous forms with integer coefficients in 
n-variables of degree n. One of the first works in this area was written by A. Mar- 
kov [146] for decomposable quadratic forms in two variables. Further, H. Daven- 
port, in a series of works [52], [53], [54], [55], and [56], made the first steps for the 
case of decomposable forms for n = 3. 


Consider A € SL(n, Z). Denote by P(A,v) the parallelepiped spanned by vectors 
v, A(v), ...,A7-1(v), Le, 


OSA 11=0,....0-1}, 


P(A,v) = {o+¥ 2a‘) 


where O is the origin. 
Definition 25.26. The Markov—Davenport characteristic (or the MD-characteristic, 
for short) of an SL(n, Z) operator A is the functional 

A,:R" >R defined by Aa(v) = V(P(A,v)), 


where V(P(A,v)) is the nonoriented volume of P(A,v). 


Proposition 25.27. Consider A € SL(n,Z) and let B € (A). Then for an arbitrary 
point v we have 
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Aa(v) = Aa(B(v)). 


Remark. Proposition 25.27 implies that the MD-characteristic naturally defines a 
function over the set of all orbits of the Dirichlet group. 


Proof. Since B € 3(A), we have the equality A”B(v) = BA"(v). Hence the paral- 
lelepiped P(A, B(v)) coincides with B(P(A,v)). Since B € SL(n,Z), the volume of 
the parallelepiped is preserved. Therefore, 


Aa(v) = Ag(B(v)). 


25.4.1.2 Homogeneous forms associated to SL(n, Z) operators 


Consider any SL(n,Z) operator A. Suppose that it has k real eigenvectors g1,...,2% 
with eigenvalues r,...,7;, and 2/ complex eigenvectors gx+2j~1 V—len+2 ; with 
complex conjugate eigenvalues c,,C1,...,c7,€;, where k + 21 =n. Consider the sys- 
tem of coordinates 


OX, X2... XYZ, VoL... YZ) 
corresponding to the basis {g;}. A form 
l 
«(Ts[]o3+9) 
j=l 


with nonzero @ is said to be associated to the operator A. 


Proposition 25.28. Let A be an SL(n, Z) operator whose characteristic polynomial 
has distinct roots. Then the MD-characteristic of A coincides with the absolute value 
of a form associated to A for a certain nonzero Q. 


Proof. Let us consider the formulas of the MD-characteristic of A in the eigenbasis. 


We assume that the coordinates in this eigenbasis are (t),...,t,). Then for any vector 
v = (t,...,fn) we have 
j Jj Pee ad J ad 
Al (X) = (rth, +s Wath Cyber Cite, ++ Cpthy 211,67 tea21). 
Therefore, 


Aa(t,-. 


k I 
rae “iT 10} +2) 


k 
al] [ti II th42j-1h+2)) 
i=l 


j=l 


Simple calculations show that a 4 0. 
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25.4.1.3, Hessenberg complexity in terms of the MD-characteristic 


Proposition 25.29. Consider an operator A with Hessenberg matrix M = (aj,;) in 
some integer basis {@;}. The Hessenberg complexity ¢(M) equals the value of the 
MD-characteristic A, (é1). 


Proof. Denote by V; the plane spanned by vectors v,A(v),A7(v),...,A*~!(v). 
Let us inductively show that 
k 
A*(é1) = [Jain Exst +V~, Where vg € Vp. 
i=! 


Base of induction. We have A(é,) = a1, 2€2 +41 141. 
Step of induction. Suppose that the statement holds for k = m, i.e., 


m 
A™(é1) — (Fes Em+1 +Vn, and Vn €& Vin 
i=1 


Let us prove the statement for m+1. Since M is Hessenberg, A(vj,) is in Vin4t. 
Therefore, we have 


m 
Am™1(6))=A (fais) én) +A(Vm) 
m+1 - m 
= ( I aii) Emt1 + (40m) ale (ti 7) (A(t) dina) ; 


The second summand in the last expression is in V,,,1. We have completed the 
induction step. 


Therefore, 


n—1 
Aa(@1) = [J laivtal” = $(M). 
=I 


In the proof of the next theorem we use the following corollary. 


Corollary 25.30. Consider an operator A with matrix M. Let v be any primitive 
integer vector. Consider the matrix (M|v) constructed by the algorithm of Sec- 
tion 25.2.2. Then we have 


¢(M|v) = Aa(v). 
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25.4.2 Klein—Voronoi continued fractions and minima of 
MD-characteristics 


In the following theorem we use Klein—Voronoi continued fractions to find minima 
of MD-characteristics. 


Theorem 25.31. Let A be an SL(n,Z) operator with matrix M in some integer 
basis. Suppose that the characteristic polynomial of A is irreducible over Q. Let 
U be a fundamental domain of the Klein—Voronoi continued fractions for A (see 
Definition 25.21). Then we have: 

(i) For every ¢-reduced matrix M integer conjugate to M, there exists v € U such 
that M = (M|v). 

(ii) Let v € U. The matrix (M|v) is ¢-reduced if and only if the MD-characteristic 
Ag attains its minimal value on the set Z" \ {O} at the point v. 


Proof. Theorem 25.31 (ii) follows directly from Corollary 25.30. 


Let us prove Theorem 25.31 (i). By Proposition 25.28 there exists a nonzero con- 
stant @ such that in the system of coordinates OX,X2...X;Y|Z,¥2Z2 ... YZ) the MD- 
characteristic ®, is expressed as follows: 


[s[]o? +2 


Suppose that the minimal absolute value of ®, on the set of integer points except 
the origin equals mo. 

Consider a cone 7, in the plane z} = --- = z; = 0 and choose the coordinates 
(X1,---;Xk,)1,---,)7) in it. Consider a projection of IR” to the cone along the orbits 
Ta(*). Since the MD-characteristic is constant on T,(u) for every u, the projection 
of the MD-characteristic is well defined. Denote it by ®,. In the chosen coordinates 
of 7,, the function ®, is written as follows: 


k l 
a [Tx [D7 
j=l j=l 


This function is convex in every orthant of the cone 7. Since every factor-sail is the 
boundary of a certain convex hull in each orthant, all minima of the convex function 
@, restricted to the convex hulls are attained at the boundary, i.e., at 2-integer points 
of factor-sails. Therefore, all integer minima of ®4 on Z”" \ {O} are at vertices of 
the Klein—Voronoi continued fraction. 

By Corollary 25.30, the Hessenberg complexity ¢(M|v) coincides with the MD- 
characteristic A4(v). Since every matrix integer conjugate to M has a presentation 
in the form (M]v) and all integer minima of the MD-characteristic are attained at 
vertices of the Klein—Voronoi continued fraction, every ¢-reduced operator M is 
represented as (M|vo) for some vertex vo € KVCF (A). By Corollary 25.7, for every 
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B € &(A) we have 
(M|B(vo)) = (M|vo). 


Hence a vector vo can be chosen in the fundamental domain U. This concludes the 
proof. 


Remark 25.32. In Example 25.5 the ¢-reduced Hessenberg matrices 


012 023 
M,;={|100 and Mo=|111 
035 034 


are integer conjugate but do not coincide. The reason for this is as follows. Con- 
sider the Klein—Voronoi continued fraction of A with matrix Mj). It contains in- 
teger vertices p; = (1,0,0) and pz = (0,1,—1). It turns out that p; and po are 
not in the same orbit of the Dirichlet group but have the same MD-characteristic, 
namely 3. Hence we get two distinct integer conjugate ¢-reduced Hessenberg ma- 
trices: My = (Mj|(1,0,0)) and Mz = (M,|(0, 1, —1)). 


25.4.3 Construction of ¢-reduced matrices by Klein—Voronoi 
continued fractions 


As we have already proved, the ¢-reduced Hessenberg matrix for the operator A 
is constructed starting from some vertex in a fundamental domain of the Klein— 
Voronoi multidimensional continued fraction. We use this property to find all ¢- 
reduced Hessenberg matrices in a given integer conjugacy class of matrices. 


Algorithm (techniques) to find ¢-reduced matrices in integer conjugacy 
classes. 


Input data. An integer matrix M for A. 
Goal of the algorithm. To construct all ¢-reduced matrices integer conjugate to M. 


Step 1. Find a fundamental domain of the Klein—Voronoi continued fraction for 
the operator A (see Remark 25.33 below). 

Step 2. Take all vertices of the fundamental domain constructed in Step 1 and 
find among them all vertices with minimal value of the MD-characteristic (say 
V1y--- Vk). 

Step 3. By Theorem 25.31(i) and (ii) all ¢-reduced matrices integer conjugate to 
M are (M|v1),...,(M|v,). They are all constructed by the algorithm described in 
Section 25.2.2. 


Output. A list of all ¢-reduced matrices integer conjugate to M. 
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Remark 25.33. Currently, Step 1 is the most complicated. The real spectrum case of 
matrices with all eigenvalues being real has been well studied. For the algorithms of 
constructing multidimensional continued fractions in this case, see Section 24 and 
the papers by R. Okazaki [166], T. Shintani [200], J.-O. Moussafir [156], and the au- 
thor [103]. E. Korkina in [130] and [129], G. Lachaud in [134], [136], A.D. Bruno 
and V.I. Parusnikov in [31], [174], [175], and [176], and the author in [96] and [97] 
have produced a large number of fundamental domains for periodic algebraic two- 
dimensional continued fractions (see also the site [29] by K. Briggs). Some funda- 
mental domains in the three-dimensional case are found, for instance, in [99]. The 
case with complex conjugate eigenvalues which we study in the next section, is new. 


Example 25.34. Let us consider the example of the operator A defined by the matrix 


a eer. eee) 
i, 2-2 
= ee 


The characteristic polynomial of this operator has three distinct real roots. There- 
fore, the Klein—Voronoi continued fraction consists of eight sails. The composi- 
tions of operators —Id, A, and 2Id+A7! define integer congruences for all these 
sails, and hence all ¢-reduced operators are written from vertices of one sail. Con- 
sider a sail containing the point (1,0,0). There are exactly three distinct orbits of the 
Dirichlet group containing the vertices in this sail. They are defined by the following 
points: 
(0,0,1), (1,0,0), and (3,—1,1). 


(We skip all calculations of convex hulls corresponding to the sail.) The MD- 
characteristic of these vectors are respectively 1, 2, and 4. So the minimum of the 
MD-characteristic (which is | in this case) is attained on the vertices of the orbit of 
the Dirichlet group containing (0,0, 1). Therefore, there exists a unique ¢-reduced 
Hessenberg matrix, which is 

00 1 

10 1 

01-3 


The perfect Hessenberg matrices for the vertices (1,0,0) and (3,—1,1) are respec- 
tively 


el £01 
100] and [203 ], 
02-3 01-4 


and their ¢-complexities are 2 and 4. 
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25.5 Diophantine equations related to the Markov—Davenport 
characteristic 


Let A be an arbitrary SL(n,Z) operator whose characteristic polynomial is irre- 
ducible over Q, and let N be an arbitrary integer. In this section we discuss how to 
solve the Diophantine equation 


Aa(v) =N. (25.1) 


We begin with a general definition of w-continued fractions. 


25.5.1 Multidimensional w-sails and w-continued fractions 


We have already introduced the w-sails of continued fractions in the one-dimensional 
case in Section 14.2.5. Let us generalize this notion to the multidimensional case. 


25.5.1.1 Definition 


Consider an operator A in SL(n,Z) whose characteristic polynomial is irreducible 
over Q. As above, we suppose that A has k real eigenvalues and / pairs of com- 
plex conjugate eigenvalues, where k + 2/ =n. Recall that the complement to the 
union of all invariant hyperplanes in the cone 7; consists of 2 arewise connected 
components denoted by C(A),...,C x (A). 


Definition 25.35. We define the n-sails for an arbitrary simplicial cone C with vertex 
at the origin inductively. 

— let the /-sail be the sail of C. 

— suppose that all w-sails for w < wo are defined. The convex hull of all z- 
integer points except the origin and except for the z-integer points for all w-factor- 
sails with w < wo contained in an arcwise connected component C;(A) is called the 
wo-factor-sail of C;(A). We denote it by Sy. :(A). 


Let us write the wo-factor-sail of C; in one formula: 


A 


wo-l 
Sqa(A) = (conv ( ({4 € ms | Ta(g)C(A)NZ" £0}\ {0})\ LJ Syi(A))). 
w=1 


The set of all w-factor-sails for a given operator is called the w-factor-continued 
fraction for the operator A. 


Definition 25.36. The union of all orbits T4(*) in R” represented by the points in 
the w-factor-sail of C;(A) is called the w-sail of T4(C;); we denote it by Sy, ;(A). 
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The union of all sails is said to be the w-continued fraction for the operator A in 
the sense of Klein—Voronoi. We denote it by KVCF,,(A). 


In other words, we have 


Swi= (J Ta(p) and KVCF,( = Usui 
peSy(A) 


25.5.1.2 Minima of Markov—Davenport characteristics on w-sails 


Denote by @,, the minimal absolute value of the Markov—Davenport characteristic 
|A,| on KVCF,(A). 


Proposition 25.37. The sequence 0), Q2,03,... is strictly increasing. 


Remark 25.38. The w-sails for arbitrary w > 0 are not necessarily homothetic to the 
1-sail, as it was in the one-dimensional case of Section 14.2.5. 


Proof of Proposition 25.37. Consider the coordinate system 
OX, X2... XYZ, VoL... YZ) 


introduced in Section 25.3.1.1 above. In these coordinates the form Ay, is written as 


follows: 
l 
Ay (Migacogh 1X o=e([fsT]oo+3 )) 
1 
for some nonzero constant c. Consider an arbitrary ray r with vertex at the ori- 
gin and not contained in any invariant subspace of A. Assume that its direction is 
(Ai,.-.,;An). Then at points of this ray the function Ay, is as follows: 


k l 
Aa(Ait, see Ant) =c (TT ae a) t”. (25.2) 
i=1 


j=l 


It is clear that the ray r intersects the w-sail first and only thereafter intersects the 
(w+ 1)-sail. Notice that the expression on the right side of Equation 25.2 is propor- 
tional to t” (with respect to f), which increases as t increases. Therefore, the value 
of Ay, at the point of intersection of the ray r with the (w + 1)-sail is strictly greater 
than the value at the point of intersection with the corresponding w-sail. 

Notice that every possible limit value of A, on the (w+ 1)-sail is attained at 
some point of its fundamental domain (since the closure of a fundamental domain 
is compact). In particular, the infimum of |A,4| at the (w + 1)-sail is attained as a 
minimum at some point. Therefore, O41 > Qhy. 


Corollary 25.39. For an arbitrary integer w, Gy > w. 
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Proof. The convexity of a Markov—Davenport form on the intersection of cones Cj 
with the plane 7+ implies that the minimum @/, is attained at an integer vertex of the 
KVCF,,(A). Therefore, 0, is a positive integer for every w > 0. Now the statement 
of the corollary follows directly from Proposition 25.37. 


25.5.2 Solution of Equation 25.1 


The following general techniques helps to write all the solutions of equation (25.1). 
Techniques to solve equation (25.1). 


Input Data: Given a pair (A,N), where A is a GL(n, Z) matrix whose characteristic 
polynomial is irreducible over Q, and N is an integer. 

Goal of the algorithm. Find all integer solutions of A4(v) = N. 

Step 1. Write the MD-characteristic A, according to Definition 25.26. 

Step 2. Find the fundamental domains of all w-sails C,,,; for the operator A (for all 
admissible i) and w < N; denote their union by U. Notice that UZ” is finite. 
Step 3. Check all integer points of U whether they satisfy the equation 


Aa(v) =N. 


Denote the set of all integer solutions in U by Zy. 
Output Data: the solution of A(v) = N is the set 


L) BZ). 


Bez(A) 


Remark 25.40. By Corollary 25.39 it is enough to check only the vertices of the 
w-sails for w < N. 


25.6 On reduced matrices in SL(3, Z) with two complex 
conjugate eigenvalues 


In this section we study the case of Hessenberg SL(3,Z) matrices with two complex 
conjugate and one real eigenvalue. It turns out that the majority of such matrices 
of the same hessenberg type are ¢-reduced. We start in Section 25.6.1 with some 
notation and definitions. In Section 25.6.2 we formulate a supplementary theorem 
on the parabolic structure of the set of nonreal spectrum matrices, whose proof we 
give in Section 25.6.5. Further, in Section 25.6.3 we formulate the main result on 
asymptotic uniqueness of ¢-reduced matrices, whose proof is in Section 25.6.6. In 
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Section 25.6.4 we examine some particular examples of families of matrices with 
fixed Hessenberg type. All the results of this section are new. 


25.6.1 Perfect Hessenberg matrices of a given Hessenberg type 


For simplicity, we study only SL(3,Z) matrices whose characteristic polynomials 
are irreducible over Q. There are two main geometrically essentially different cases 
of such matrices: the real spectrum matrices (or RS-matrices for short), whose char- 
acteristic polynomials have only real eigenvalues, and the nonreal spectrum ma- 
trices (or NRS-matrices) whose characteristic polynomials have a pair of complex 
conjugate and one real eigenvalue. 

For a Hessenberg type Q we denote the subset of all NRS-matrices in H(Q) by 
NRS(Q). 


Definition 25.41. Let Q = (411,421 |a12,422, 432). Consider v = (413,423,433) such 
that the determinant of the matrix (a;;) equals 1. Define 


11 412 41M + a12N + a43 


Ho (m,n) = | 421 422 421M + a22N + A723 
0 az2 a32N + a33 


It is clear that 
H(Q)= {HQ (m,n)|m eZneZ}. 


In this context, to choose a vector v means to choose the origin O in the plane 
H(Q). So the set H(Q) has the structure of a two-dimensional plane (see also Sec- 
tion 25.2.4 above). We denote by OMN the coordinate system corresponding to the 
parameters (m,n). Notice that the Hessenberg complexity of this family is a4023. 


Let Discr{,(m,n) denote the discriminant of the characteristic polynomial of 
Hg (m,n). Then the set NRS(Q) is defined by the following inequality in variables 
nand m: 

Discrg (m,n) <0. 
Example 25.42. In Fig. 25.3 we show the subset NRS((0, 10,0, 1)). For this exam- 
ple we choose v = (0,0, 1). 


25.6.2 Parabolic structure of the set of NRS-matrices 


The set NRS((0,1|0,0,1)) in Fig. 25.3 “looks like” the intersection of Z? with the 
union of the convex hulls of two parabolas. Let us formalize this in a general state- 
ment. 
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A 
nm 

Ue at x(-1) = 0 

ates 
Diva! 
COPE REC PCP rt m 
a TOTTI rrr rrr rrr t}———-»> 
m=n—2 x(1) =0 


Fig. 25.3 The family of matrices of Hessenberg type (0, 1|0,0, 1). 


Consider the matrix HZ, (0,0) = (ajj) and define b;, bz, and b3 such that the 
characteristic polynomial of this matrix in the variable ¢ is 


—1 +b t? —bot + bs. 


Even though in the case of SL(3,Z) we have b3 = 1, nevertheless we write b3 for 
generality reasons. For the family H{ (m,n) we define the following two quadratic 
functions: 


P1,a(m,n) =m— ayn? — Bin— N: 
i agim — ay\n\2 ayjm— ayn 
ae an( ) Bo( ) 1B 


a) a1 a) 
where 
o 432 Gj O02 
,=- 2= 
4ar’ 4b; ” 
B a1 — 422 — a33 B by 
es SEES ee pa 
2431 2b3’ 
Aby — bi b3 — 4b b3 
n= aa » = ——__.. 
21432 4a21a32b3 


In the real plane OMN of the family HO (m,n) we consider the interior of the 
circle of radius R with center at the origin (0,0) and denote it by Br(O). For a real 
number ¢ we set 
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A; = {(m,n) | (pi.a(m,n) —t)(p2,a(m,n) —t) < Of. 


Theorem 25.43. For every positive € there exists R > 0 such that the following 
inclusions hold: 


Ae\Br(O) C NRS(Q)\Br(O) C A-e\Br(O). 


We give the proof of this theorem in Section 25.6.5 below. 


25.6.3 Theorem on asymptotic uniqueness of ¢-reduced 
NRS-matrices 


Recall that a ray is said to be integer if its vertex is integer and it contains integer 
points distinct from the vertex. 


Definition 25.44. An integer ray in H(Q) is said to be an NRS-ray if all its integer 
points correspond to NRS-matrices. A direction is said to be asymptotic for the set 
NRS(Q) if there exists an NRS-ray with this direction. 


As stated in Theorem 25.43, for every Hessenberg type Q the set NRS(Q) almost 
coincides with the union of the convex hulls of two parabolas. This implies the 
following statement. 


Proposition 25.45. Let Q = (a11,a21|@12,422,a32). There are exactly two asymp- 
totic directions for the set NRS(Q), defined by the vectors (—1,0) and (ai1,a21). 


Consider a family of Hessenberg matrices HQ for an appropriate integer vector 
v. 


Definition 25.46. Define 
R™ 
mia” = {H6(m-t,n) 
Ry0y = ar eet (m+ait, ntanit , Z>0}. 


By Rio, (t), or respectively by R5"o ,(t), we denote the ¢th element in the corre- 
sponding family. 


Remark 25.47. The families Ry Oy and R5"o , coincide with the sets of all integer 
points of certain rays with directions (—1,0) and (a1;,a1) respectively. Conversely, 
from Proposition 25.45 it follows that the set of integer points of every NRS-ray 
coincides either with Roy or with Ry Oy for some integers m and n. 


In Fig. 25.4 we show in dark gray two NRS-rays: 


—9,5 . 
— Ry 311,1,3),(0,0,-1) from the left; 


— Ry 1.211,1,3),(0,0 -1) from the right. 


378 25 Gauss Reduction in Higher Dimensions 


> CLOIIIIIIIIIIITit 


/ 


v 


Fig. 25.4 Every NRS-ray contains finitely many ¢-nonreduced matrices. 


Now we are ready to formulate the main result on asymptotic behavior of NRS- 
matrices, which we prove later in Section 25.6.6. 


Theorem 25.48. (On asymptotic ¢-reducibility and uniqueness.) (i) Every NRS- 
ray (as in Fig. 25.4) contains only finitely many ¢-nonreduced matrices. 

(ii) Every NRS-ray contains only finitely many matrices that are integer conjugate 
to some other ¢-reduced matrix. 


Example 25.49. Every NRS-ray for the Hessenberg type (0, 1|0,0, 1) contains only 
¢-reduced perfect matrices. Experiments show that every NRS-ray for (0, 1|1,0,2) 
contains at most one ¢-nonreduced matrix (see Fig. 25.5). 
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mt+n=-—1 x(-1) =0 
ml T] 
a 
i] 
m 
[-}—_—_—_—_» 
(NS x(1) = 0 


Fig. 25.5 The family of Hessenberg matrices H i 11,0 (m,n). 


25.6.4 Examples of NRS-matrices for a given Hessenberg type 


In this subsection we study several examples of families NRS(Q) for the Hessenberg 
types: 


(0,1]0,0,1), (0,1]1,0,2), (0,1]1,1,2), and (1,2]1,1,3). 


In Figs. 25.5,. 25.6, and 25.7 the dark gray squares correspond to ¢-nonreduced 
matrices. We also fill with gray the squares corresponding to ¢-reduced Hessenberg 
matrices that are the nth powers (n > 2) of some integer matrices. 

Hessenberg perfect NRS-matrices 1 i wert (m,n). The Hessenberg complexity 
of all these matrices is 1, and therefore, they are all c-reduced (see the family in 
Fig. 25.3 on page 376). 

Hessenberg Perfect NRS-Matrices reise it, roam 
ity of these matrices equals 2. Experiments show that 12 such matrices are ¢c- 
nonreduced (see the family in Fig. 25.5). It is conjectured that all other Hessenberg 
matrices of NRS((0, 1|1,0,2)) are ¢-reduced. 


n). The Hessenberg complex- 


Hessenberg Perfect NRS-Matrices H ei it, 0 ay (m m,n). The Hessenberg complexity 
of these matrices equals 2. We have found 12 ¢-nonreduced matrices in the family. 
It is conjectured that all other Hessenberg matrices of NRS((0,1|1,1,2)) are ¢- 
reduced. See Fig. 25.6. 
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n 
mt+n=-—-1 x(-1) =0 
m 

[-}—_—_—_—_» 
i x(1) = 0 


Fig. 25.6 The family of Hessenberg matrices H oe ) 2) (m,n). 


fs x(-1) =0 


m—n =2 


Fig. 25.7 The family of Hessenberg matrices H, ee ; ay 


Hessenberg Perfect NRS-Matrices H, Sore 4 - (m,n). This is a more complicated 
example of a family of Hessenberg perfect NRS-matrices, with their complexity 
equaling 12. We have found 27 ¢-nonreduced matrices in the family. It is conjec- 
tured that all other Hessenberg matrices of NRS((1,2|1,1,3)) are ¢-reduced. See 


Fig. 25.7. 
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25.6.5 Proof of Theorem 25.43 


We begin the proof with several lemmas, but first a small remark. 


Remark. The set NRS(Q) is defined by the inequality 
Discry, (m,n) <0. 


In the left part of the inequality there is a polynomial of degree 4 in variables m and 
n. Notice that the product 


1643, 439b3 (Pi,0 (m,n)p2.0 (m,n)) 
is a good approximation to Discr¢ (m,n) at infinity; i.e., the polynomial 
Discr’, (m,n) — 16a3,a3b3 (p1,0(m,n)p2,0(m,n)) 
is a polynomial of degree only 2 in the variables m and n. 


Lemma 25.50. The planar curve Diser aio 1 (n,n) = 0 is contained in the do- 
main defined by the inequalities — 


(m? —4n+3)(n? +4m+3) > 0, 
(m? —4n —3)(n? + 4m —3)—72 <0. 


Remark. Lemma 25.50 implies that the curve Diser{4.\io 0,1) (7) = 0 is contained 
in some tubular neighborhood of the curve 


(m? —4n)(n* +4m) =0. 


Proof. Note that 
«_(1,0,0 
Disc oi (m,n) = (m* — 4n)(n? + 4m) —2mn — 27. 
Thus, we have 


DSS ico (m,n) — (m? —4n +3)(n? +4m+3) 


=-2(n 3)2 2(m+3)?—(n+m)? <0, 


and 
Dist aot (m,n) — (m* —4n—3)(n? +4m—3) +72 


= 2(n—3)? +2(m+3)? +(n—m)? > 0. 


Therefore, the curve Diserigiing 1) (m,n) = 0 is contained in the domain defined in 
the lemma. 


382 25 Gauss Reduction in Higher Dimensions 


Lemma 25.51. For every Q = (a41,421|@12, 422,432) there exists an affine (not nec- 
essarily integer) transformation of the plane OMN taking the curve DiscrQ (m,n) = 


0 to the curve Disery io oa) (m,n) =0. 


Proof. Let HQ (0,0) = (a;,;). Note that every matrix H{ (m,n) is rational conjugate 
to the matrix 


Oo Get 
Ao 1)0,0,1) yn F 


where 


/ 
M = 73032 — 411433 + 412421 — 422433 — 441422 + 421d32M — a11a32n, 
n! = ay +472 +433 +.a30n, 


by the matrix 
Lay aj, +aa21 
XG = | 0 aa ayiaz1 +.a21a22 
0 0 a21a32 


0,0 


my 


Therefore, the curve Discry (m,n) = 0 is mapped to the curve Dist eats (m,n) = 
0 biectively. In OMN coordinates this map corresponds to the following affine 
transformation: 


m)\ _, (421932 —ariagon \ 47332 —A 11433 +4 12421 —A22433—A 422 
n a32n a1 +422 +433 ; 


This completes the proof of the lemma. 


Proof of Theorem 25.43. Consider a family of matrices H?,(—p1,q(0,t) +€,t) with 
real parameter t. Direct calculations show that for € 4 0 the discriminant of the 
matrices for this family is a polynomial of the fourth degree in the variable ft, and 


: 1 
Discrg (—pi,a(0,t) +€,t) = garde + O(t3). 


Therefore, there exists a neighborhood of infinity with respect to the variable t such 
that the function Discry,(—p1,a(0,t) + €,t) is positive for positive € in this neigh- 
borhood, and negative for negative €. 

Hence for a given € there exists a sufficiently large Nj = N)(€) such that for 
every t > N, there exists a solution of the equation Discry (m,n) = 0 at the segment 
with endpoints 


(— pia (0,t) +¢,t) and (— p1,0(0,t) —€,t) 


of the plane OMN. 


Now we examine the family in the variable t for the second parabola: 


HY’, | t—a11 p20 (t,0)— “=e, —ar1 pr. a (t,0 - sire). 
2 (1-eupsalt.0)- ritire—eaiprat.0)~ 7p 
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For the same reasons, for a given € there exists a sufficiently large Nz = N2(€) 
such that for every t > No there exists a solution of the equation Discr{ (m,n) = 0 
on the segment with endpoints 


11 
= o i@11 gp 1 
(t 411P2,Q (t,0)+ peer a421P2,Q (t,0)+ aE!) 


of the plane OMN. 
The set 


(t—a11p2,0 Ue prem —421P2,0 ¢,0)— iz) and 
a 


P1,Q (m,n) p2,0 (m,n) =0 


is a union of two parabolas and therefore has four parabolic branches “approaching 
infinity.” We have shown that for each of these four parabolic branches there exists a 
branch of the curve Discr{, (m,n) =0 contained in the €-tube of the chosen parabolic 
branch if we are far enough from the origin. 


From Lemma 25.50 we know that Diseriy sig 1) (m,n) = 0 is contained in some 
tubular neighborhood of 


P1,(0,10,0,1) (m,1) P2,(0,1\0,0,1) (m7) = 0. 


Then by Lemma 25.51, the curve Discr{ (m,n) = 0 is contained in some tubular 
neighborhood of the curve 


P1,Q (m,n) p2,0 (m,n) =0 


outside some ball centered at the origin. Finally, by Viéte’s Theorem, the intersection 
of the curve Discr’, (m,n) = 0 with each of the parallel lines 


ayy +421 
Lp: =n 


—-m=t 
a2) 
contains at most four points. Therefore, there exists a sufficiently large T such that 
for any t > T, the intersection of the curve Discrg (m,n) =0 and @, contains ex- 
actly 4 points corresponding to the branches of the parabolas p;,9(m,n) = 0 and 
P2,9(m,n) = 0 lying in A_¢ \ Ag. 
From Lemma 25.50 and Lemma 25.51 it follows that the set 


{aura 
= eh 


—m< r} A {Discr), (m,n) = 0} 
a2) 


is compact. 
Therefore, there exists R = R(€,N1,N2,T) such that in the complement to the 
ball Br(O) we have 
Ag C NRS(Q) C A_¢. 


The proof of Theorem 25.43 is complete. 
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25.6.6 Proof of Theorem 25.48 


First we prove that a fundamental domain of a general Klein—Voronoi continued 
fraction can be chosen from a certain bounded subset of R*. Then we describe 
asymptotically the geometric structure of Klein—Voronoi continued fractions. Fi- 
nally, we prove Theorem 25.48. 


25.6.6.1 One general fact on fundamental domains of Klein—Voronoi 
continued fractions for NRS-matrices of SL(3, Z) 


Consider an NRS-operator A and any integer point p distinct from the origin O. 
Denote by 1 (p) the convex hull of the union of two orbits corresponding to the 
points p and A(p). For every integer k we denote by D*(p) the set A*(I?(p)). 


Proposition 25.52. Let p be a nonzero integer point. Then there exists a funda- 
mental domain of the Klein—Voronoi continued fraction for A with all (integer) 
orbit-vertices contained in the set D°(p). 


The proof is based on the following lemma. Let 


Th(p) = U T(r). 
keZ 


Lemma 25.53. Let p be a nonzero integer point. Then one of the Klein—Voronoi 
sails for A is contained in the set I4(p). 


Proof. Notice that the set I\(p) is a union of orbits. Let us project I4(p) to the 
half-plane 2. The set I4(p) projects to the closure of the complement of the con- 
vex hull for the points I,(A*(p)) 2, for all integer k in the angle defined by the 
eigenspaces. Since all points A*(p) are integer, their convex hull is contained in the 
convex hull of all points corresponding to integer orbits in the angle. Hence the set 
TA(p) 4 contains the projection of the sail. Therefore, the set [4(p) contains one 
of the sails. 


Proof of Proposition 25.52. Since —Id exchanges the sails, one can choose a fun- 
damental domain entirely contained in one sail. Let this sail contain a point p. 
By Lemma 25.53 the set [4(p) contains this sail. Therefore, all orbit-vertices of 
a fundamental domain for the Klein—Voronoi continued fraction can be chosen in 


Ty’ (p). 


25.6.6.2 Geometry of Klein—Voronoi continued fractions for matrices of 
Rn 
1,Q,v 


Let us prove the following statement. 
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Proposition 25.54. Consider an NRS-ray R‘'6 ‘ay Then there exists a constant C > 0 
such that for every t > C, all orbit-vertices Of one of the fundamental domains for 
the Klein—Voronoi continued fraction of the matrix Rio y(t) are contained in the 
set of all integer orbits corresponding to the integer points in the convex hull of 
(1,0,0), (a11,421,0), and (—aj1,—a21,0). 


We begin with the case of matrices of Hessenberg type Qo = (0, 10,0, 1). Such 
matrices form a family H (Qo) with real parameters m and n: 


aes 001 
1 1)0,0,1) (7) ={10m 
Oln 


Put vo = (1, 0) ,0). 


Lemma 25.55. Let Ri'0, yp De an NRS-ray. Then for every € > 0 there exists C > 0 
such that for every t > C. the convex hull of the union of two orbit-vertices 


ae alt 1)(1,0,0) and Tee (t) (0, 1,0) 


is contained in the €-tubular neighborhood of the convex hull of three points (1,0,0), 
(0,1,0), (0,—1,0). 


Remark. Lemma 25.55 means that the corresponding domain tends to be flat as ¢ 
tends to infinity. Recall that R1'5, ,, (t)(1,0,0) = (0, 1,0) for every r. 


Proof. Let us find the asymptotics of eigenvectors and eigenplanes for operators 
I ona (t) as t tends to +co. Denote a real eigenvector of Ri oye (t) by e(t). Direct 
calculations show that there exists u #0 such that 


e(t) =u ((1,0,0) + O(r-')). 


Consider the invariant real plane of the operator R””, 1 “Oo, vp (t ) (it corresponds to a pair 
of complex conjugate eigenvalues). Note that this plane} is a union of a certain family 
of closed orbits of Ry Qo.vo(t ). Direct calculations show that any orbit in this family 
is an ellipse with semiaxes Agmax(t) and Agmin(t) for some positive real number /, 
where 

8max(t) = (0,t,0) +O(1), 

8min(t) = (0,0,1'/7) + O(- 1). 


The vectors gmax(t)+W—Igmin(t) are two complex eigenvectors of the operator 
R10, 25 (t). For the ratio of the lengths of maximal and minimal semiaxes of any 
orbit we have the following asymptotic estimate: 


A|8max(t)| 


= |¢|!/2 + o((|t|- 1/7). 
ease II Cao) 


First, since 
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1 =i 
Co erro ees ), 


the minimal semiaxis of the orbit-vertex Tae = t) (1,0, 0) is asymptotically not 


greater than O(t~!). Therefore, the length of the maximal semiaxis is asymptoti- 
cally not greater than some function of type O(|t|~!/). Hence, the orbit of the point 
(1,0,0) is contained in the (C;|t\~!/2)-ball of the point (1,0,0), where C| is a con- 
stant that does not depend on f. 

Second, we have 


1 = 
(0, 1,0) — =max(¢) = O(\e|""). 
Thus, the length of the maximal semiaxis of the orbit-vertex Tn (r) (1,0,0) is 
:20,%9 


asymptotically not greater than some function 1 + o(t!/ >). Hence, the length of 
the minimal semiaxis is asymptotically not greater than some function O(|t|~!/2). 
This implies that the orbit of the point (0, 1,0) is contained in the (C2|t|~!/2)-tubular 
neighborhood of the segment with vertices (0,1,0) and (0,—1,0), where C2 is a 
constant that does not depend on fr. 

Therefore, the convex hull of the union of two orbit-vertices 


Tynn (10,0) and Tym (0, 1,0) 


is contained in the C-tubular neighborhood of the triangle with vertices (1,0,0), 
(0, 1,0), (0,—1,0), where C = |t|~!/? max(C,,C2). This concludes the proof of the 
lemma. 


Let us now extend the statement of Proposition 25.52 to the general case of Hes- 
senberg matrices. 


Corollary 25.56. Let Q = (aj1,421|412,422,432) and let Res be an NRS-ray. Then 
for every € > 0 there exists C > 0 such that for every t > C the convex hull of the 
union of two orbit-vertices 

Tan (¢)(1,0,0) and Tame (1) (411,421,0) 
is contained in the €-tubular neighborhood of the convex hull of three points (1,0,0), 
(a11,421,0), (—ay1,—a21,0). 


Remark. Recall that RY" , (t)(1,0,0) = (ai1,a21,0) for every t. 


Proof. Define 
421032 —A32411 411422 — a21412 
X= 0 a32 —a\| —a22 
0 0) 1 


The operator X defines two linear functions J; and /2 on two variables m and n with 
coefficients depending only on a), a21, 412, 422, and a32 satisfying 
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H{1000) (11m) - 


(0,1)0,0,1) 


san.) = XHo(m—t,n)X"!. 
a21 432 


Therefore, the ray Ri , after a change of coordinates and a rescaling is taken to the 
iit,fi ats 


1.0o.vy Of matrices with Hessenberg type 29 = (0, 1|0,0, 1) for certain m and 


ray R 

a. 
Lemma 25.55 implies the following. For every € > 0 there exists a positive con- 

stant such that for every ¢ greater than this constant the convex hull of the union of 


two orbit-vertices 


T yn, (1,0,0) and T yi 


ri 0,1,0 
He a) yO as 
is contained in the €-tubular neighborhood of the triangle with vertices (1,0,0), 
(0,1,0), (0,—1,0). 
If we reformulate the last statement for the family of operators in the old coordi- 
nates, then we get the statement of the corollary. 


Proof of Proposition 25.54. Notice that the operator Rj", ,,(t) takes the point (1,0,0 


to the point (a11,a21,0). Therefore, the convex hull of the union of two orbit-vertices 


Tan (y(1,0,0) and Tyna ()(a11,421,0) 


1,Q,v 


(we denote it by W(t)) coincides with the set Tym @) (1,0,0). 
1,Q,v 


From Proposition 25.52 it follows that there exists a fundamental domain for 
a Klein—Voronoi continued fraction with all its orbit-vertices contained in W(t). 
Choose a sufficiently small € such that the €)-tubular neighborhood of the triangle 
with vertices 


(1,0,0), (a1,1,42,1,0), and (—a11,—a21,0) 


does not contain integer points distinct from the points of the triangle. From Corol- 
lary 25.56 it follows that for a sufficiently large ¢ the set W(t) is contained in 
the €9-tubular neighborhood of the triangle. This implies the statement of Propo- 
sition 25.54. 


25.6.6.3 Geometry of Klein—Voronoi continued fractions for matrices of 
m,n 
Ry a 
Now let us study the remaining case of the rays of matrices with asymptotic direc- 
tion (a11,a21). We recall that Qo = (0, 10,0, 1). 


Proposition 25.57. Consider an NRS-ray Ke » Lhen there exists a constant C > 0 
such that for every t > C all orbit-vertices of one of the fundamental domains for the 
Klein—Voronoi continued fraction of the matrix Rot) are contained in the set of 
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all integer orbits corresponding to the integer points in the convex hull of (1,0,0), 
(—1,0,0), and (a11,421,0). 


The proof of this proposition is based on a corollary of the following lemma. 
Lemma 25.58. Let I ai ve be an NRS-ray. Then for every € > 0 there exists C > 0 
such that for every t > C the convex hull of the union of two orbit-vertices 


Trmn (9 10;0) and ae (t) (0, 1,0) 


2,29. 


is contained in the €-tubular neighborhood of the convex hull of three points (1,0,0), 
(—1,0,0), (0, 1,0). 


Proof. First, notice that the Klein—Voronoi continued fractions for the operators A 
and A~! coincide. 


Second ae nee 
Le 1%; -1 
Ay 10,01) (+t) = XH (0 1/9 0,1)(—2-t, —m)X 
where 
0 —l-—n-t 
X={|-10 --m 
0 0 -!l 


Thus, in the new coordinates we obtain the equivalent statement for the ray 
Ry Go. (t). Now Lemma 25.58 follows directly from Lemma 25.55. 


Corollary 25.59. Let Q = (aj1,421|412,422,432) and let Ro oy be an NRS-ray. Then 
for every € > 0 there exists C > 0 such that for every t > C the convex hull of the 
union of two orbit-vertices 


Tr, (m+a,,t,n-+a 1) el 0, 0) and Try, (m+ay,t,n+a21t) (a 11,421, 0) 


is contained in the €-tubular neighborhood of the triangle with vertices (1,0,0), 
(—1,0,0), and (aj1,a21,0). 


Remark. We omit the proofs of Corollary 25.59 and Proposition 25.57, since they 
repeat the proofs of Corollary 25.56 and Proposition 25.54. 


25.6.6.4 Proof of Theorem 25.48 


Step 1. Let A be an operator with Hessenberg matrix M in SL(3,Z). By Proposi- 
tion 25.29 the Hessenberg complexity of the Hessenberg matrix M coincides with 
the MD-characteristic A4(1,0,0). Therefore, the Hessenberg matrix M is ¢-reduced 
if and only if the MD-characteristic of A attains the minimal possible absolute value 
on the integer lattice, except at the origin, exactly at the point (1,0,0). 
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Step 2. By Theorem 25.31 (ii) we know that all minima of the set of absolute values 
for the MD-characteristic of A are attained at integer points of the Klein—Voronoi 
sails for A. 


Step 3. By Theorem 25.31(i) we can restrict the search of the minimal absolute 
value of the MD-characteristic to an arbitrary fundamental domains of the Klein— 
Voronoi continued fraction. 


Step 4.1. The case of NRS-rays with asymptotic direction (—1,0). By Proposi- 
tion 25.54 there exists T > 0 such that for every integer ¢t > T all integer points 
of one of the fundamental domains for Raat) are contained in the convex hull of 
three points (1,0,0), (a11,4@21,0), and (—a11,—a},0). 

This triangle contains only finitely many integer points, all of which have the last 
coordinate equal to zero. The value of the MD-characteristic for the points of type 
(x,y,0) equals 

(arix—a1y)azyy"t +C, 


where the constant C does not depend on f, but only on x, y, and the elements q;_; of 
the matrix HZ (m,n). 

So, for any point (x,y,0), the MD-characteristics linearly increase with re- 
spect to the parameter t. The only exceptions are the points of type 1(1,0,0) and 
L(a11,421,0) (for integers A and 1). The values of the MD-characteristics are con- 
stant in these points with respect to f. 

Since there are finitely many integer points in the triangle (1,0,0), (a11,a21,0), 
and (—a1,—a21,0), for sufficiently large t the MD-characteristic at points of the 
triangle attains its minimum at (1,0,0) or at (ai1,a21,0). Since R'9 ,(t) takes 
the point (1,0,0) to the point (a11,a21,0), the values of the MD-characteristics at 
(1,0,0) and at (a11,a21,0) coincide. 

Therefore, for sufficiently large ¢ the matrix 


(1,0,0) 


m—t,n 
(411,421 uidmaany n) 


is always c-reduced and there are no other ¢-reduced matrices integer conjugate to 
the given one. This implies both statements of Theorem 25.48 for every NRS-ray 
with asymptotic direction (—1,0). 

Step 4.2. The case of NRS-rays with asymptotic direction (a,,a21). This case is 
similar to the case of NRS-rays with asymptotic direction (—1,0), so we omit the 
proof here. 


Proof of Theorem 25.48 is complete. 
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25.7 Open problems 


In this section we formulate open questions on the structure of the sets of NRS- 
matrices and briefly describe the situation for RS-matrices. 


NRS-matrices. As we have shown in Theorem 25.48, the number of ¢-nonreduced 
matrices in any NRS-ray is finite. Here we conjecture a stronger statement. 


Conjecture 31. Let Q be an arbitrary Hessenberg type. All but a finite number of 
NRS-matrices of type Q are ¢-reduced. 


If the answer to this conjecture is positive, we immediately have the following 
general question. 


Problem 32. Study the asymptotics of the number of ¢-nonreduced NRS-matrices 
with respect to the growth of Hessenberg complexity. 


Denote the conjectured number of ¢-nonreduced NRS-matrices of Hessenberg 
type Q by #(Q). Numerous calculations give rise to the following table for all types 
with Hessenberg complexity less than 5. 


Q |], 1]0,0, 1), 1[1,0,2)[(0, 111, 1,2) ]], 111, 0, 3)], 11, 1,3)](, 1]1,2,3) 


¢(Q) 1 2 2 3 3 3 
#(Q) 0 12 12 6 10 10 

Q_ |KO, 1]2,0,3) |(0, 1]2, 1,3) | (0, 1]2,2,3) |], 2]0,0, 1) |] (0, 1]1,0,4)] (0, 11, 1,4) 
¢(Q) 3 3 3 4 4 q 
#(Q) 14 10 10 94 6 8 

Q_|}(0,1[1,2,4)](0, 1]1,3,4)| (0, 1]3,0,4) | (0, 1]3, 1,4) |(0, 113, 2,4) |(0, 113, 3,4) 
¢(Q) 4 4 4 4 4 4 
#(Q) 10 8 10 12 8 8 


RS-matrices. Now we say a few words about real spectrum matrices (i.e., about 
SL(3,Z)-matrices with three distinct real roots). Mostly we consider the family 
H((0,1|1,0,2)); the situation with the other Hessenberg types is similar. 
Recall that 
Oln+l1 
(m,n) =!10 m 
022n+1 


(1,0,1) 
Ao ii1.02 


This matrix is of Hessenberg type (0,1|1,0,2), and its Hessenberg complexity 
equals 2. Hence H, Hen - (m,n) is ¢-reduced if and only if it is not integer conjugate 
to some matrix of unit Hessenberg complexity. All such matrices are of Hessenberg 
type (0, 1|0,0, 1). 


In Fig. 25.8 we show all matrices H ain ) 2) (m,n) within the square 


—20 < m,n < 20. 


25.7 Open problems 391 


m=—-n—-1 x(-1) =0 


m=n-1 x(1) =0 


Fig. 25.8 The family of matrices of Hessenberg type (0, 1|1,0,2). 


The square in the intersection of the mth column with the nth row corresponds to 


the matrix 7 i a } 2) (m,n). It is colored in black if the characteristic polynomial 
has rational roots. The square is colored in gray if the characteristic polynomial is 


irreducible and there exists an integer vector (x,y,z) with the coordinates satisfying 
—1000 < x,y,z < 1000, 


such that the MD-characteristic of Ho 1\1,9,2)(1,0,1)(mm,n) equals 1 at (x,y,z). All 
the rest of the squares are white. 

If a square is gray, then the corresponding matrix is ¢-nonreduced (see Re- 
mark 25.29). If a square is white, then we cannot conclude whether the matrix is 
¢-reduced (since the integer vector with unit MD-characteristics may have coordi- 
nates with absolute values greater than 1000). 

It is most probable that white squares in Fig. 25.8 represent ¢-reduced matrices. 
We have checked explicitly all white squares for 
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—10<m,n< 10, 


and found that all of them are ¢-reduced (such squares are contained inside the big 
black square shown on the figure). 


We show a boundary broken line between the NRS- and RS-squares in gray. 


Remark. In Fig. 25.8 the NRS-domain is easily visualized. It almost completely 
consists of white squares. On the other hand, the RS-domain contains a relatively 
large number of black squares. This indicates a significant difference between the 
RS and NRS cases. 


Direct calculations of the corresponding MD-characteristics show that the fol- 
lowing proposition holds. 


Proposition 25.60. [fan integer m+n is odd, then H, ep ie (m,n) is ¢-reduced. 


Remark. On the one hand, Proposition 25.60 implies the existence of rays en- 
tirely consisting of ¢-reduced matrices. On the other hand, in contrast to the NRS- 
matrices, there are some rays consisting entirely of ¢-nonreduced RS-matrices. For 
instance, all matrices corresponding to the integer points of the lines 


()m=n; ()m=n+2; (3)m=-—n; 
(4)m=-n-2; (5)n=3m—4, (6)m=3n+6 
are not ¢-reduced (we do note state that the list of such lines is complete). 


So Theorem 25.48 does not have a direct generalization to the RS-case and we 
end up with the following problem. 


Problem 33. What is the percentage of ¢-reduced matrices among matrices of a 
given Hessenberg type Q? 


It is likely that almost all Hessenberg matrices are ¢-reduced (except for some 
measure-zero subset). 


25.8 Exercises 
Exercise 25.1. Let 7 be an integer k-dimensional plane in R”. Describe the locus 


of all integer points a unit integer distance to 7. 


Exercise 25.2. Prove that the Hessenberg matrices 


013 025 
100 and 112 
038 037 


are not integer conjugate but have the same Hessenberg complexity and the same 
characteristic polynomial. 
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Exercise 25.3. Let A € GL(n,R) preserve the level sets of the form x” + y?. Prove 
that there exists @ such that either 


cosa sing —cos@ sind 
A= ( ; ) or A= ( . ) : 
—sind cos a sin@® cosa 
Exercise 25.4. Consider A € GL(n,Z) whose characteristic polynomial is irre- 


ducible over Q. Find the generators of the abelian group 7!(A) in the eigenbasis 
of A. 


Exercise 25.5. Consider A,B € GL(n,Z) whose characteristic polynomial is irre- 
ducible over Q. Then A and B commute if and only if the Dirichlet groups of A and 
B coincide. 


Exercise 25.6. If an integer m-tn is odd, then Hes 2) (m,n) is ¢-reduced. 


Exercise 25.7. Prove that all matrices H, eine 2) (m,n) corresponding to integer 
points of the lines - 


()m=n; (2)m=n+2; (3)m=—n; 
(4)m=—n—-2; (5)n=3m—4, (6)m=3n+6 
are ¢-reduced. Are there some other similar lines in the plane (m,n)? 


Exercise 25.8. How many different nonempty Hessenberg types of complexity n 
exist forn = 1,2,...,10? 

Exercise 25.9. Prove that the minimum of the Markov—Davenport characteristic 
on the corresponding w-sail is attained at some vertex of this w-sail. 


Exercise 25.10. Find an example of an invariant cone for an irreducible operator 
whose w-sails are not homothetic to each other. 


| ® 


Check for 
updates 


Chapter 26 
Approximation of Maximal Commutative 
Subgroups 


We have already discussed some geometric approximation aspects in the plane in 
Chapter 14: we have studied approximations, first, of an arbitrary ray with vertex at 
the origin, and second, of an arrangement of two lines passing through the origin. 
In this chapter we briefly discuss an approximation problem of maximal commu- 
tative subgroups of GL(n,R) by rational subgroups (introduced recently in [109]). 
In general this problem touches the theory of simultaneous approximation and both 
subjects of Chapter 14. The problem of approximation of real spectrum maximal 
commutative subgroups has much in common with the problem of approximation 
of nondegenerate simplicial cones. So it is clear that multidimensional continued 
fractions should be a useful tool here. Also we would like to mention that the ap- 
proximation problem is linked to the so-called limit shape problems (see for in- 
stance [217]). 


In Section 26.1 we give general definitions and formulate the problem of best 
approximations of maximal commutative subgroups. Further, in Section 26.2 we 
discuss the connection of three-dimensional maximal commutative subgroup ap- 
proximation to the classical case of simultaneous approximation of vectors in R?. 

In this chapter we denote \/—1 by J. 


26.1 Rational approximations of MCRS-groups 


In this section we give general definitions and formulate basic concepts of approx- 
imation of maximal commutative subgroups. We begin in Section 26.1.1 with the 
definition of MCRS-groups and say a few words about how they relate to simpli- 
cial cones. Further, in Section 26.1.2 we extend the definition of two-dimensional 
Markov—Davenport forms to the multidimensional case. We define rational sub- 
groups and their “size” in Section 26.1.3. The distance function (discrepancy) be- 
tween two subgroups is introduced in Section 26.1.4. Then in Section 26.1.5 we 
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give the definition of best approximations. Finally, we conclude this section with a 
link to the classical problem of approximating real numbers by rational numbers. 


26.1.1 Maximal commutative subgroups and corresponding 
simplicial cones 


26.1.1.1 Maximal commutative subgroups 


For an arbitrary matrix A of GL(n,R), we denote by Cy (n,R) (A) its centralizer, i-e., 
the set of all the elements of GL(n, RR) commuting with A. 


We say that a matrix is regular if all its eigenvalues are distinct (but not nec- 
essarily real). In this chapter we discuss only the most studied case of centralizers 
defined by regular matrices. Notice that if a matrix A is regular, then the centralizer 
of CL (n,R) (A) is commutative. 


Definition 26.1. Let A be a regular matrix in GL(n,R). We call the centralizer 
CGL(n,R) (A) the maximal commutative subgroup of GL(n,R) (or MCRS-group for 
short). 


We say that a subspace is invariant for a centralizer Cgy (np) (A) if it is invariant 
for every element of Cg1 (n,R) (A). Let a regular operator A have k real and 2/ complex 
eigenvalues (k + 2/ = n). Then the MCRS-group C1 (n,R) (A) has k one-dimensional 
and / two-dimensional minimal invariant subspaces. In case / = 0, we say that the 
MCRS-group is real spectrum. 


Remark 26.2. Notice that if A is a regular matrix in GL(n,Z), then the Dirich- 
let group =(A) is a subgroup of the centralizer Cg] (nz) (A). We would like just to 
mention that Dirichlet groups usually play an important role in the study of best 
approximations of the corresponding MCRS-groups. 


26.1.1.2 The space of simplicial cones 


An n-sided simplicial cone in R” is the convex hull of the union of n unordered rays 
with vertex at the origin. We consider only the cones that are not contained in a 
hyperplane. Denote by Simpl, the space of all n-sided simplicial cones. 

Let us describe a relation between real spectrum maximal commutative sub- 
groups from one side and n-sided simplicial cones from the other. There exists a nat- 
ural 2”-fold covering of the space of all real spectrum MCRS-groups by the space 
Simpl,: the cones map to MCRS-groups whose eigendirections are the extremal 
rays of the cones. Therefore, approximation problems within the space Simpl, can 
be studied in terms of the MCRS-groups. 


Remark 26.3. The space Simpl, is not compact for n > 1. It admits a transitive right 
action of GL(n,R) and possesses an invariant measure U,,. This measure is usually 
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called the Mobius measure, and is similar to the M6bius measure for the space of 
all multidimensional continued fractions discussed in Chapter 23. 


26.1.2 Regular subgroups and Markov—Davenport forms 


Consider an arbitrary MCRS-group 2 in GL(n,R) and denote its eigenlines (both 
real and conjugate) by /,,...,/,. Fori=1,...,n, denote by LZ; a nonzero linear form 
over the space C” that attains zero values at all vectors of the complex lines /; for 
j Zi. Let det(LZi,...,L,) be the determinant of the matrix having in the kth column 
the coefficients of the form L; for k = 1,...,n in the dual basis. 


Definition 26.4. We say that the form 


a GAGs. ‘ i) 


det(L,...,Ln) 


is the Markov—Davenport form for the MCRS-group 2 and denote it by Bg. 


Remark 26.5. Notice that Markov—Davenport forms are closely related to MD- 
characteristics studied in the previous section. Namely, if A € 21 is an operator with 
distinct eigenvalues, then the Markov—Davenport form is proportional to the MD- 
characteristic A,. 


Let us continue with a particular example. 


Example 26.6. Consider the MCRS-group corresponding to the Fibonacci operator 


11 
10)" 
The Fibonacci operator has two eigenlines, 
y=-—Ox and y= 0-'x, 
where @ is the golden ration rs 
bonacci operator is 


. So the Markov—Davenport form of the Fi- 


Ox) (y — O-1x 1 


Notice that we have already studied two-dimensional Markov—Davenport forms 
in Chapter 11 and Chapter 14. Let us formulate and prove the following two simple 
properties of Markov—Davenport forms. 


Proposition 26.7. The Markov—Davenport form of an MCRS-group 2 is defined 
by 2 up to a sign. 
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Proof. All regular operators of 21 have the same set of eigenlines. Hence the forms 
[; are uniquely defined by the MCRS-group up to a multiplication by a scalar and 
permutations. Every scalar multiplication is normalized by the determinant in the 
denominator, while a sign of a permutation is not caught by the denominator. 


Proposition 26.8. All the coefficients of the Markov—Davenport form are either 
simultaneously real, if there is an even number of minimal invariant planes of the 
corresponding MCRS-group, or simultaneously complex. 


Proof. By definition, every MCRS-group contains a real operator with distinct 
eigenvalues. Linear forms related to real eigenvalues have real coefficients. Pairs 
of linear forms corresponding to pairs of complex conjugate eigenvalues are com- 
plex conjugate up to a multiplicative complex factor. Without loss of generality we 
choose this factor to be equal to 1. This implies that the product of two such lin- 
ear forms is real. Due to the determinant in the denominator each pair of complex 
conjugate roots brings an additional multiplicative factor J = /—1. 


Further details of the proof we leave to the reader. 


26.1.3 Rational subgroups and their sizes 


Recall some definitions related to complex geometry. A complex number is called 
a Gaussian integer if it is of the form a+ /b with integers a and b. The set of all 
Gaussian integers forms a lattice with the operation of vector addition. We say that 
a vector is Gaussian if all its coordinates are Gaussian integers. A one-dimensional 
subspace of C” is called Gaussian if it contains a nonzero Gaussian vector. A Gaus- 
sian vector is said to be primitive if all its coordinates are relatively prime with 
respect to the multiplication of Gaussian integers. 


Proposition 26.9. Every Gaussian one-dimensional subspace contains exactly four 
primitive Gaussian vectors. In addition, a vector v is primitive if and only if the 
vectors —v, Iv, and —Iv are primitive as well. 


Notice that in the case of C! there are four primitive vectors (or units): they are 
1 and +/. 


For a complex vector v = (a;4+/b1,...,d,+Ib,), denote by |v| the norm 


It is clear that the minimum of the norm | * | among the Gaussian vectors in a Gauss- 
ian one-dimensional subspace is attained at a primitive Gaussian vector. 


Now we are ready to define rational MCRS-groups. 
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Definition 26.10. An MCRS-group 2 is called rational if every of its eigenlines 
contains a Gaussian nonzero vector. 


Definition 26.11. Consider a rational MCRS-group 2. Let /;,...,/, be the eigen- 
lines of 2l. Let v; be a primitive Gaussian vector in /; fori =1,...,n. The size of 2 
is the real number 


_ mun (Ivil,---5[¥al), 
i=1,....n 


which we denote by v(2). 


26.1.4 Discrepancy functional 


Let us define a natural distance between MCRS-groups. Let 2; and 2l2 be two 
MCRS-groups. Consider the following two symmetric homogeneous forms of de- 
gree n each, the combination of two Davenport forms: 


Dy, + Py, and Dy, — Py,. 


Take two maximal absolute values of the coefficients of these forms (separately). 
The minimum of these two maximal values is considered the distance between 2, 
and 215, which we call the discrepancy and denote by p(2l;, 22). 


Example 26.12. Consider the following two operators 


0 — 


1 with eigenvectors (J,1) and (—/, 1), 


i i) with eigenvectors (1,2) and (1, —2). 
These operators have distinct eigenvalues, and therefore, they define MCRS-groups 
(denote them by 2; and 22 respectively). Both MCRS-groups are rational, where 


the operators of the first one have real eigenvalues, and where the operators of the 
second have pairs of complex conjugate eigenvalues. Direct calculations show that 


v(2i) =1 and V(Qy) =2. 
Now let us calculate the discrepancy p (21, 2l2). We have 


2 2 2 2 
x+y yo —4x 
| Py, (v) + By, (v)| = I 7H q 


and therefore p(2l1,2l2) = eo 
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26.1.5 Approximation model 


In the above notation the problem of approximation of MCRS-groups by rational 
MCRS-groups can be formulated as follows. 


Definition 26.13. (The problem of best approximation.) For a given MCRS- 
group 2 and a positive integer N, find a rational MCRS-group jy of size not ex- 
ceeding N such that for every other rational MCRS-group 2’ of size not exceeding 
N we have 


p (21, Av) < p(t, 2’). 


Remark 26.14. There is another important class of MCRS-groups that would also 
be interesting to consider in this context. An MCRS-group is said to be algebraic if 
it contains regular operators of GL(n, Z). It would be interesting to develop approx- 
imation techniques of MCRS-groups by algebraic MCRS-groups. 


26.1.6 Diophantine approximation and MCRS-group 
approximation 


The classical problem of approximating real numbers by rational numbers is a par- 
ticular case of the problem of best approximations of MCRS-groups in the following 
sense. 

For a nonzero real number a, we denote by 21[a] the MCRS-group of GL(2,R) 


with invariant lines x = 0 and y = ax. Denote by 264 the set of [a] satisfying 


1>a > 0. Denote also by 2X6 1] the subset of 264 defined by rational numbers a. 
For every pair of relatively prime integers (p,q) satisfying 0 < A < 1 we have 


“(a(2))=« 


Let us calculate the discrepancy between two MCRS-groups [0%] and 2l[a]. 
for arbitrary positive real numbers a and Ob, 


x — Ax Xx — Ax 
Py ja.) — Parjon] = lu I zd Y 1 Lis = (Q) — a )x", 
x(y— x) | x(y— Ox) 


i i = 2xy— (2+ M%)x". 


Pa joy| + Pron] = 
Since & > 0 and Q > 0, we have 
P (2A[or1], A[o]) = |or — A]. 


Now the classical problem of approximation of real numbers by rational num- 
bers is reformulated in terms of MCRS-groups as follows: for a given MCRS-group 
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Ala] € 264 and a positive N, find an approximation A[p/q] € 2B of size not 


exceeding N such that for every other MCRS-group 2A[p'/q'| € 285 of size not 
exceeding N, 


p(2[a],2[p/q]) < p (Alo), 2[p'/q']). 


26.2 Simultaneous approximation in R* and MCRS-group 
approximation 


The theory of simultaneous approximation of a real vector by vectors with rational 
coefficients is also to some extent a special case of MCRS-group approximation. 
We say a few words about this similarity in this section. After a brief description 
of the approximation model, we study two particular examples of simultaneous ap- 
proximation in the framework of MCRS-group approximation. 


26.2.1 General construction 


Let (a,b,c) be a vector in R*. Denote by [a,b,c] the maximal commutative sub- 
group defined by an operator with the following three eigenvectors 


(a,b,c), (0,1,2), (0,1,—Z). 


We state the problem of best approximation as follows: for a given MCRS-group 
[a,b,c] and a positive number N, find an approximation Ala’ ,b' ,c’] of size not ex- 
ceeding N such that for every other MCRS-group Ala" ,b" ,c"!| of size not exceeding 
N, we have 


p (2a, b,c], Mla’, b',]) < p (fa, b,c], Ala"",b",c"). 


Notice that 
v(2[a,b,c]) = max(a,b,c) 


and 


Cc b Cc 1 
Pajanc(n2) =1( ai + oxy rai xy? 2). 


The discrepancy between the MCRS-groups 2 [a,b,c] and 2[a',b’,c’] is as follows: 
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b! bete2 pl 4el? 


LD eV) ia b c Cc 
p (Ala, b,c], Ala ,b se)) = min (max ( So alse eel oe 3 5 
b, ob c,¢ R42, b+!” 
max ( a + Qa t a@\?| I lz : 


In the case of positive a, b, c, a’, b’, and c’, we simply have 


bb | | pete? — pl4e? 


a a 


c_e 
a a 


d 


p (a, b,c], Ala’, b’,c']) = max ( 


Remark 26.15. For the case of classical simultaneous approximation one takes a 
slightly different distance between the vectors: 


b_ ob 


a a 


p (2[a, b,c}, Aa’, b’,c’]) = max ( 


ee 
a 


= 


So the described approximation model is not exactly the classical model of simulta- 
neous approximation. 


Let us study two particular examples. 


26.2.2 A ray of a nonreal spectrum operator 


Consider the nonreal spectrum algebraic operator 


O11 
B=|001 
100 


This operator can be thought of as the simplest nonreal spectrum operator from a 
geometric point of view. Denote the real eigenvalue of the operator B by & and its 
complex conjugate eigenvalues by 2 and &3. Notice that 


[E1| > |€2| = [63]. 


Let us approximate the eigenline corresponding to &. Let ve, be the vector in this 
eigenline having the first coordinate equal to 1. In this setting we have 


&| © 1.3247179573 and ve, & (1,0.5698402911,0.7548776662). 


The set of best approximations with size not exceeding 10° contains 48 elements. 
All these elements are of type B”'(1,0,0), where (n;) is the following sequence: 


(4,6,7,8,9, 10,...,50,51,52). 


We conjecture that the set of best approximations coincides with the set of points 
B*(1,0,0), where k = 4 or k > 6; the approximation rate in this case is CN~3/2. The 
structure of the set of best approximations is usually closely related to the corre- 
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sponding Dirichlet group. In this case the Dirichlet group &(B) is homeomorphic to 
Z®Z/2Z; the generators are B and —Id. 


26.2.3 Two-dimensional golden ratio 


Consider now the algebraic real spectrum operator of the two-dimensional golden 


ratio 
321 


G=[|221 
111 


As we have already seen in Chapter 22, this operator has the simplest continued frac- 
tion in the sense of Klein. The Dirichlet group =(G) is generated by the following 
two operators: 


111 01 0 
Hy=[(110] and A={1-1 1 
100 01-1 


Notice that G = He and H> = (H, —Id)~!. The operator Hj here is the three- 
dimensional Fibonacci operator. Denote the eigenvalues of H; by &, 2, and &3 


such that 
[51] > 1&2] > 1&3. 


Let us approximate the eigenline corresponding to €). Denote by ve, the vector 
of this eigenline having the last coordinate equal to 1. Notice that 


&1 © 2.2469796037 and vz, © (2.2469796037, 1.8019377358, 1). 


The set of best approximations of size not exceeding 10° contains 41 elements; 
40 of them (except for the third best approximation) are contained in the set 


{Hi"#3(1,0,0) | m,n€ zh. 
In the following table we show the consecutive pairs of powers (m;,n;) for the these 


41 best approximations except for the third one. So in the column i we get m = mj, 
n =n; for the approximation H;”'H;'(1,0,0). 


i |]1 2 14 |5 6 |7 |8 |9 | 10) 11) 12] 13) 14) 15) 16)17)18)19)20)21)22 


mi\\ 1 3/4 |4|5|5|6)6)6|7 10}10}11}11 
nj 1 jl j2 41424173 )2)3)2)1)3)2)3)/2)4)3)4)3)5)4 


io) 
oa 
Co 
Co 
\o 
\o 
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i |/23}24)25)26|27|28/29)30)3 1)32/33|34)35|36|37|38/39|40)41 


mj||11}12}12/13}13]14)14)15)15)15)16)16)17)17/18)18)19}19}19 
ni || 3}4)3)5)4)5)4)6)5)4)5)4)6)5]6)5)/7/6]5 


The third best approximation is 21[3,2, 1]; it is not of the form Hj"H3(1,0,0). We 
conjecture that the set of all best approximations except 2([3,2,1] is contained in 
the set of all points of type Hj"H3(1,0,0), with the approximation rate in this case 
being CN~3/2, 

We conclude this chapter with the following general remark. 

Remark 26.16. In the above two examples we have several algebraic artifacts (such 
as missing B°(1,0,0) as a best approximation for the first example and an addi- 
tional best approximation 2/3,2, 1] for the second example). This is not a surprise, 


since we do not approximate the triples of eigenvectors simultaneously but a certain 
eigenvector together with two vectors 


(0,1,2),  (0,1,—/). 


We mix vectors of different natures, and as a result we have irregularities. It is 
probable that in most common situations such artifacts may occur infinitely many 
times. 


26.3 Exercises 


Exercise 26.1. (a). Prove that if an operator A is regular, then the centralizer of 
A € GL(n,R) is commutative. 
(b). Describe all matrices whose centralizers are commutative. 


Exercise 26.2. For an arbitrary A € GL(n,R) describe all invariant subspaces of 
CeL(n,R) (A). 


Exercise 26.3. Let G be a commutative subgroup of GL(n,R). Suppose that for 
every matrix A € GL(n,R) \ G there exists a matrix A € G such that A and A do not 
commute. Is it true that there exists a matrix B € G such that G = Cg 1.) (B)? 


Exercise 26.4. Prove that the space Simpl2 is homeomorphic to the M6bius band. 


Exercise 26.5. Find the first 10 best approximations 2[a, b,c] for the minimal and 
the middle eigenvectors of the three-dimensional golden ratio G. 


Exercise 26.6. Using the theory of MCRS-groups formulate a problem of approxi- 
mation of planes in R* by integer planes. 


Check for 
updates | 


Chapter 27 
Other Generalizations of Continued Fractions 


In this chapter we present some other generalizations of regular continued fractions 
to the multidimensional case. The main goal for us here is to give different geomet- 
ric constructions related to such continued fractions (whenever possible). We say 
a few words about Minkowski—Voronoi continued fractions (Section 27.1), trian- 
gle sequences related to Farey addition (Section 27.2), O’Hara’s algorithm related 
to decomposition of rectangular parallelepipeds (Section 27.3), geometric contin- 
ued fractions (Section 27.4), and determinant generalizations of continued fractions 
(Section 27.5). Finally, in Section 27.6 we describe the relation of regular continued 
fractions to rational knots and links. 

We do not pretend to give a complete list of generalizations of continued frac- 
tions. The idea is to show the diversity of generalizations. Let us give several refer- 
ences to some subjects that are beyond the scope of this book but that may be inter- 
esting and useful in the framework of geometry of continued fractions: p-adic con- 
tinued fractions [144], [187], [87], [81], complex continued fractions [189], Rosen 
continued fractions and their expansions [186], [131], geodesic continued fraction 
approach based on Minkowski reduction [139]. Finally, we would like to mention a 
relation of continued fractions to tiling of a square by rectangles [137], [64], [183]. 


27.1 Relative minima 


In this section we consider a geometric generalization of continued fractions in 
terms of local minima. This generalization appears for the first time in the works 
of H. Minkowski [151] and G.F. Voronoi [219] (see also in [220]) and is used to 
study units in algebraic fields. Several properties of local minima were later studied 
by G. Bullig in [34]. The main object of Minkowski—Voronoi continued fractions 
is an arbitrary complete lattice in R? with respect to the fixed coordinate lat- 
tice. Statistical properties of relative minima were studied by A.A. Illarionov in [89] 
and [90]. See also [37] and [78] for more information about three-dimensional rela- 
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tive minima. A description of the multidimensional case can be found in paper [36] 
by V.A. Bykovskii. 

The main idea of the Minkowski approach is to define extremal nodes of a lattice 
IT" with respect to a family of convex sets that can be chosen with certain freedom 
(for instance, vertices of sails considered in previous chapters of this book can be 
considered tetrahedral local minima). In this section we describe the classical ap- 
proach that works with coordinate parallelepipeds. 

For simplicity, in this section we consider only sets in R? in general position (in 
the sense that no two vertices of the lattice are in a plane parallel to some coordinate 
plane). 


27.1.1 Relative minima and the Minkowski—Voronoi complex 


We begin with a rather algebraic definition of the Minkowski— Voronoi complex. In 
later subsections, we will construct a tessellation of the plane showing the geometric 
nature of the complex. 

Let T be an arbitrary subset of R) = (Rso)”. For i=1,...,n we set 


T; = max{x; | (x1,---,%n) € T}. 
We associate to T the following parallelepiped: 
TI(T) = {(x1,---,%n) |O< a7 < T,,8=1,...,n}. 


Definition 27.1. Let S be an arbitrary subset of R49. An element s € S is called a 
Voronoi relative minimum with respect to S if the parallelepiped IT({s}) contains no 
points of S except for the origin. 


Definition 27.2. Let S be an arbitrary subset of R%_ and let S be the set of all its 


Voronoi relative minima. A finite subset T C S is called minimal if the parallelepiped 
TI(T) contains no points of $\T except for the origin. We denote the set of all 
minimal k-element subsets of $ C S by I2,(S). 


It is clear that any minimal subset of a minimal subset is also minimal. 


Definition 27.3. A Minkowski—Voronoi complex MV (S) is an (n — 1)-dimensional 
complex such that 

(i) the k-dimensional faces of MV(S) are enumerated by its minimal (n—k)- 
element subsets (i.e., by the elements of N,,_,(S)); 

(ii) a face with a minimal subset 7; is adjacent to a face with a minimal subset 
Tp # T, if and only if T; C 1. 


Remark 27.4. In the three-dimensional case it is also common to consider the 
Voronoi and Minkowski graphs that are subcomplexes of the Minkowski— Voronoi 
complex. They are defined as follows. 
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The Voronoi graph is the graph whose vertices and edges are respectively vertices 
and edges of the Minkowski— Voronoi complex. 

The Minkowski graph is the graph whose vertices are edges are respectively faces 
and edges of the Minkowski—Voronoi complex (two vertices in the Minkowski 
graph are connected by an edge if and only if the corresponding faces in the 
Minkowski—Voronoi complex have a common edge. 


So in some sense Minkowski and Voronoi graphs are dual to each other. 
Example 27.5. Let us consider an example of a 6-element set Sy C IR? defined as 
follows 
So = {81,82,53,84,55,86}, 

where 

sy = (3,0,0), $2 = (0,3,0), $3 = (0,0,3), 

54 = (2, 1,2), S5= fais 1), 56 = (2,3,4). 
There are only five Voronoi relative minima for the set Sp, namely the vectors 


S},---,55. The Minkowski—Voronoi complex contains 5 vertices, 6 edges, and 5 
faces. Its vertices are 


Vy =151,53,84}, vo ={53,54,55},. V3 = {51,5455}, 
v4 = {52,53,55}, v5 = {51,52,55}. 


Its edges are 
= {51,53}, eo = {53,50}, e3 = {51,82}, 
e4 = {53,54}, e5 = {51,84}, e6 = {54,55}, 
e7 = {53,55}, eg = {51,85}, eo = {52,55}. 


Its faces are 


fi ={s1}, fr = {2}, fa = {53}, fa = {sa}, fs = {ss}. 


Finally, we describe the complex MV(S) as a tessellation of an open two-dimensi- 
onal disk. We show vertices (on the left), edges (in the middle), and faces (on the 


right) separately: 
£1 ¢ Le, 
€5 va 


WV 
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27.1.2 Minkowski—Voronoi tessellations of the plane 


In this subsection we discuss some geometric images standing behind the notion of 
the Minkowski—Voronoi complex in the three-dimensional case. 


Definition 27.6. Let S be an arbitrary subset of R3o. The Minkowski polyhedron for 
S is the boundary of the set 7 


S@RXq9 = {str|s €S,re R3o}. 


In other words, the Minkowski polyhedron is the boundary of the union of copies 
of the positive octant shifted by vertices of the set S. 


Definition 27.7. The Minkowski—Voronoi tessellation for a set S C R3, is a tessel- 
lation of the plane x + y+ z = 0 obtained by the following three steps. 

Step 1. Consider the Minkowski polyhedron for the set S and project it orthogo- 
nally to the plane x+y +z=0. This projection induces a tessellation of the plane 
by edges of the Minkowski polyhedron. 

Step 2. Remove from the tessellation of Step 1 all vertices corresponding to local 
minima of the function x + y+ z on the Minkowski polyhedron (this is exactly the 
relative minima of S). Remove also all edges coming from all the removed vertices. 

Step 3. After Step 2 some of the vertices are of valence 1. For each vertex v of 
valence | and the only remaining edge wv with endpoint at v we replace the edge 
wv by the ray wv with vertex at w and passing through v. 


Proposition 27.8. The Minkowski—Voronoi tessellation for a nice set S has the com- 
binatorial structure of the Minkowski—Voronoi complex MV (S). 


Remark 27.9. Here we do not specify what the word “nice” means. We say only 
that it includes finite sets and complete lattices considered below. 


Example 27.10. Consider the set So as in Example 27.5. In Fig. 27.1 we show the 
Minkowski polyhedron (on the left) and the corresponding Minkowski— Voronoi 
tessellation (on the right). The local minima of the function x+y-+z on the 
Minkowski polyhedron for So are the relative minima /),..., f5. They identify the 
faces of the complex MV(So). The local maxima of the function x + y +z on the 
Minkowski polyhedron for So are v1,...,v5, corresponding to vertices of the com- 
plex MV (So). The vertices v,,...,v5 are as follows: 


vy = (3,1,3), v2 = (2,2,3), v3 = (3,2,2), 
v4 = (1,3,3), vs = (3,3,1). 
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rt+tyt+z=0 


h | 


Fig. 27.1 The Minkowski polyhedron (on the left) and the corresponding Minkowski—Voronoi 
tessellation (on the right). 


27.1.3 Minkowski—Voronoi continued fractions in R° 


Now let us give a definition of Minkowski—Voronoi continued fractions for com- 
plete lattices in R*. Consider a lattice I’ defined as follows: 


T= {mv +mgv2 +m3Vv3 :m1,m2,m3 € Zh, 
where v1, v2, v3 are linearly independent vectors in R°?. Define 


IP] = ((al bl zl) | @9,z) € PF \ {(0,0,0)}- 


Definition 27.11. We say that an edge {51,52} of the complex MV (|I"|) with vertices 
{51,52,53} and {s1,52,s4} is nondegenerate if at least one of the triples (s1, 52,53) 
and (s1,52,54) generates the lattice I’. 

Suppose that {s1, 52,53} generates the lattice. Then we equip the edge {51,52} with 
a matrix that sends s; and s2 to themselves and takes 53 to s4. 

So if both triples (s1,52,53) and (s1,s2,54) generate the lattice I’, then the corre- 
sponding edge is equipped with a pair of matrices inverse to each other. 


Finally, we have all the tools that are needed to give a definition of the multidi- 
mensional Minkowski—Voronoi continued fraction. 


Definition 27.12. Let I be an arbitrary complete lattice in R*. The Minkowski— 
Voronoi continued fraction for I" is the Minkowski—Voronoi complex MV (|I'|) all 
of whose nondegenerate edges are equipped. 


We refer to [151] and [82] for more details related to general minimal systems 
and to the papers [78] and [216] concerning the arbitrary case. 
Let us give a small remark on the periodic algebraic case. 
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Remark 27.13. Let a, B, and ybe three roots of a cubic polynomial x* +-ax? +bx+1 
with integer coefficients a and b. Consider the lattice generated by vectors 


(1,14), (a,B,Y), and (eBay): 


From Dirichlet’s unit theorem it follows that the multiplicative subgroup of diagonal 
matrices with positive diagonal elements preserving the lattice is isomorphic to Z7. 
The diagonal matrices of this group preserve both the Minkowski— Voronoi com- 
plex and all translation matrices for the corresponding equipped edges. Therefore, 
the Minkowski—Voronoi continued fraction in this case is doubly periodic. 
Remark 27.14. Let us say a few words about the two-dimensional case. Consider a 
complete lattice 7 C R?. The Minkowski—Voronoi complex for I is a broken line. 
Each edge e; is equipped with a matrix 


0 +1 
1 aj : 
Consider now sails of lattices in each of the four coordinate octants for the lattice I" 


(i.e., we consider the boundaries of the convex hulls for points of I” but not of Z). 
Then the integers a; correspond to the integer lengths of these sails. 


27.1.4 Combinatorial properties of the Minkowski—Voronoi 
tessellation for integer sublattices 


If I is a full-rank sublattice of Z”, then the Minkowski—Voronoi complex MV (|I"|) 
contains finitely many faces. Not much is known about its combinatorial structure. 
The first open problem here is as follows. 


Problem 34. Give a combinatorial description (a complete invariant) of all realizable 
finite Minkowski— Voronoi complexes for integer sublattices. 


In particular, it is interesting to consider the following problem. 


Problem 35. How many different N-vertex Minkowski—Voronoi complexes for in- 
teger lattices do there exist for a fixed NV? 


Notice that the problem of a combinatorial description of the Minkowski— 
Voronoi complex is also open in the case of infinite complexes as well. Nevertheless 
it seems almost impossible to have a nice complete invariant for it, since there is 
no testing tool to check whether an infinite complex is realizable as MV(|I"|) for 
some lattice I’. As in the case of sails in multidimensional continued fractions, the 
only infinite case in which it is possible to calculate some examples is the case of 
periodic Minkowski— Voronoi complexes corresponding to algebraic lattices (see 
Remark 27.13 above). So for the infinite case, we propose a simpler version of 
Problem 34. 
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Fig. 27.2, The Minkowski—Voronoi tessellation for |I71|. 


Problem 36. Find criteria of nonrealizability of the Minkowski—Voronoi com- 
plexes for arbitrary lattices. 


In [225] C. Yannopoulos gives several examples of representations using only 
three directions for the edges of the Voronoi graph in the plane (which is the 1I- 
skeleton of the Minkowski—Voronoi complex). In a paper [108] the authors extend 
such representations to the general case of integer sublattices. We announce this 
result here. 


Theorem 27.15. Let I’ be a sublattice of Z". The Minkowski—Voronoi tessellation 
can be chosen in such a way that the following conditions hold: 
—AIll finite edges are straight segments of one of the directions Mn (fork €Z). 
— Each vertex that does not contain infinite edges is a vertex of the following 8 


types: 


DYDD ® 


Example 27.16. Consider the lattice I,7; generated by the following three vectors: 


(171,0,0), (0,171,0), and (2,32,—1). 


Notice that the first integer point on the z-axis is (0,0,171). The diagram of 
the Minkowski—Voronoi tessellation for |I71| satisfying the conditions of Theo- 
rem 27.15 is shown in Fig. 27.2. 
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27.2 Farey addition, Farey tessellation, triangle sequences 
27.2.1 Farey addition of rational numbers 


We begin with a definition of Farey addition of rational numbers. 
Definition 27.17. Let a 1 and a be two rational numbers. We assume that p; is 
relatively prime to qg; and pz is relatively prime to gz. The rational number 


Pl .P2 _ PivP2 
Mm 4 "irq 


is called the Farey sum of and re (Note that for every integer m we consider the 
fraction +.) 

There is a nice way to construct the set of rational numbers using the operation 
of Farey addition. We do it inductively in countably many steps. 


Construction of the rational numbers. 


Base of construction. We start with the sequence of integer numbers (ai.1), where 
ai) =iforie Z. 
The kth step of the construction. Suppose that we have constructed a sequence 
(ai) (which is infinite on both sides). Let us extend this sequence to the sequence 
(aix41)- Set 

iki =4iz, forie Z; 

A+ k+l =U pOGi4i4, forie Z. 
In other words, we put between every two numbers of the sequence (a; ,.) their Farey 
sum. 


Example 27.18. The sequences for the first three steps are as follows: 


Base: ..-»—4,-3, —2,-1,0,1,2,3,4,... 
1.1 
Step 1: weey—2y a 1, ,0, iB 
Ag aes a ey 
Step 2: wey, 0 i eee 


Remark 27.19. The part of the sequence (a;,) contained between the elements 0 
and | is sometimes called the kth Farey sequence. 

Remark 27.20. The denominators appearing in the Farey sequences form Stern’s 
diatomic sequence, which is defined as follows: 


ay = 1; 
aj,=a;, foric Z,; 
agi41=ajt+ai41, forieZ,. 
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The first few elements of this sequence are 
1,1,2,1,3,2,3,1,4,3,5,2,5,3,4,1,... 


Further properties of Stern’s diatomic sequence can be found in a recent sur- 
vey [161] by S. Northshield. In this relation we would like to mention the work [123] 
by A. Knauf on ferromagnetic spin chains, where this sequence appears as Pascal’s 
triangle with memory (see also [121] and [122]). 


27.2.2 Farey tessellation 


The above construction of rational numbers appears in hyperbolic geometry as Farey 
tessellation. In this section we work with the model of the hyperbolic plane in the 
upper half-plane of the real plane R* with coordinates (x,y). 

We add the point at infinity to the line y = 0 and call it the absolute. Considering 
the coordinate x as the coordinate on the absolute, we say that the coordinate of 
infinity is co. For every pair of points (a,b) in the absolute there exists a unique 
hyperbolic line passing through these points, denoted by /(a,b). For every line / in 
the hyperbolic plane there exists a unique hyperbolic reflection with the axis /. A 
triangle all three of whose vertices are on the absolute is called an ideal hyperbolic 
triangle. 


Definition 27.21. The Farey tessellation is the minimal possible decomposition of 
the hyperbolic plane into ideal triangles such that the following two conditions hold: 
— it contains the ideal triangle with vertices 0, 1, and ©; 
— the tessellation is preserved by the group of isometries of the hyperbolic plane 
generated by all reflections whose axes are sides of ideal triangles in the tessellation. 


There is a simple way to construct this tessellation. We start with the ideal trian- 
gle with vertices 0, 1, and o. Reflecting this triangle with respect to the axes /(0, 1), 
1(0,0), and /(1,°°), we get three new triangles with vertex sets {1,2,00}, {—1,0, >}, 
and {0, 5 1} respectively. Add all of them to the tessellation. Continue iteratively to 
reflect the obtained picture with respect to the edges of new triangles. In countably 
many steps we construct all ideal triangles of the tessellation. We show the Farey 


tessellation in Fig. 27.3. 


Proposition 27.22. (i) The set of vertices of all ideal triangles in the Farey tessel- 
lation coincides with the subset of the points on the absolute whose coordinates are 
rational or ©, 

(ii) An ideal triangle with vertices p <q <r is in the Farey tessellation if and 
only if there exist i and k such that p = ajz, r = dizi, and q= pr (ie, 
q= Q2i+1,k41): 


The continued fractions for the vertices of ideal triangles of the Farey tessellation 
has the following surprising regularity. 
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Fig. 27.3 Farey tessellation in the hyperbolic plane. 


Proposition 27.23. For every ideal triangle T in the Farey tessellation there exists a 
sequence of integers ay,a\,...,4n+41 such that the vertices of T have the coordinates 
D, g, and p®q, where 


p=(0;ao:a,:-++:an], g=[Os;d0: a1: +++ dn: an4i], 
and p@®q=(0;a9: a1 +++! dn! dn41 +1). 


27.2.3 Descent toward the absolute 


It turns out that the continued fraction of a real number itself can be interpreted in 
terms of the Farey tessellation. Let us explain this. 


It is clear that the Farey tessellation is invariant with respect to the shift on the 
vector (1,0), so for simplicity we restrict ourselves to the case of real numbers a 
satisfying O<a@< 1. 

In the upper half-plane model one can imagine the Farey tessellation as a pyra- 
mid. We say one more time that we consider now only the part of the tessellation 
contained in the band 0 < x < 1. On the top of the pyramid there is the ideal triangle 
with vertices 0, 1, and oo. Suppose that we are at this triangle. It is permitted either 
to exit the pyramid at vertex | or to descend to the adjacent triangle with vertices 0, 
5 and 1. To fix notation, we say that we then descend left. 

Suppose now that we are at some ideal triangle of the tessellation with vertices 
xX) <x2 <.x3. Then it is permitted either to exit the pyramid at vertex x2 or to descend 
to one of the neighboring triangles in the tessellation: either to descend left to the 
triangle with vertices xj < x4 < x2 or to descend right to the triangle with vertices 
x2 <x5 < x3 for the appropriate point x4 or x5 respectively (see Fig. 27.4). 


In a finite or infinite number of steps we descend to some point @ of the absolute. 
At each step of the descent we go either to the right or to the left, so it is natural to 
define the descent sequence whose elements are letters L (if we descend to the left) 
and R (if we descend to the right). If the sequence is nonempty, the first is always L. 
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Fig. 27.4 Descent in the Farey pyramid. The descent sequence is LLRR, and hence the exit point 
3 
is 5 = [0;2:3]. 


Let us denote by a, the number of letters L standing before the first letter R. 
Denote by a2 the number of letters R standing before the next letter L, etc. 


Example 27.24. For instance, for the sequence (L,L,L,R,R,L,L,L,L,R,R,L,L) we 
get 
a\ =3; an =2; a3 =4; a4 =2; as =2. 


We have the following interesting theorem. 


Theorem 27.25. In the above notation the traveler will either exit the pyramid at the 
rational number 
(0; ay 22 2+++ An—1 2 Ant] 


in a finite number of steps or descend to the irrational number 


(03a, :az:--°] 


on the absolute. 


For instance, in Example 27.24 we exit at point & = [0;3:2:4: 2:3] in finitely 
many steps. In the example considered in Fig. 27.4 we have the descent sequence 
LLRR and the exit point 3 = [0;2 : 3]. 


Currently it is not clear what the analogue of the Farey tessellation in multidi- 
mensional hyperbolic geometry is, so we conclude this section with the following 
general open problem. 
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Problem 37. Find a natural generalization of the Farey tessellation to higher- 
dimensional hyperbolic geometry. 


In this regard we would like to mention the works [158], [157], [159], [160] by 
V.V. Nikulin on discrete reflection groups in hyperbolic spaces. 


Remark 27.26. It is also interesting to consider Farey tessellation in the de Sit- 
ter geometry, which is in some sense dual to the projective geometry. This is a 
new geometric approach to the classic theory of quadratic forms, developed by 
V.I. Arnold [7] and F. Aicardi [1] and [2]). 


27.2.4 Triangle sequences 


In this subsection we briefly observe several results related to triangle sequences. 
Triangle sequences generalize classical Farey addition to the two-dimensional case. 
They were introduced by T. Garrity in [67]. For multidimensional triangle sequences 
we refer to [51]. 


27.2.4.1 Definition of triangle sequence and its algebraic properties 


We work with the triangle 
A={(x,y)|12x2y> 0}. 


Consider the following partition of the triangle A into smaller triangles A, Ao,..., 
where 
Ax = {(x,y) € A | l—x—-ky > 0 > 1—x—(k-+1)y > 0} 


for k € Z,. Now let us define the map T that maps each triangle A,, to the triangle 
A. For every point (x,y) of the triangle A, define 


y |—x—ky 
T (x,y) = (=,——). 


The point T (x,y) is in the closure A = AUOA. 


Definition 27.27. Consider an arbitrary point (x,y) of the triangle A. Let T*(x,y) € 
Ayn. Then we put a, =n, fork = 1,2,... . The sequence 


(do,a1,42,...) 


is called the triangle sequence for the pair (x,y). 


Note that the triangle sequence is said to terminate at step k if T*(x,y) ¢ A\ A. 
In most cases the triangle sequence is infinite. 
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Remark 27.28. In [150] the authors proved that the triangle map T : A > A is 
ergodic with respect to the Lebesgue measure. 


27.2.4.2 Multidimensional Farey addition 


The definition of Farey addition is straightforward. 


Definition 27.29. Consider two vectors in R” whose coordinates are represented as 
ratios of numbers: 


— (PL Pn ny eae In 
T= (4,...,24) and s=(4,...,%), 


Then the Farey sum of T and S is the rational vector 


TOS= jas but). 


qM+81 °°"? Gnt+Sn 


We now use the following notation. Let T = (p1,..., py) be an integer vector in 
IR”. We denote by X the following representation of a rational vector in R"~!: 


Fa ( 22,2... Pn), 
Pi? Pi? "a 


Finally, we put by definition 
TiS=TOS. 


27.2.4.3 Explicit formula for vertices of triangles 


The map T defines a natural nested partition of the triangle A into smaller triangles. 
This partition contains triangles of the following type. 


Definition 27.30. Let (ao,...,dn) be a sequence of nonnegative integers. Define 
A(ao,--+;4n) = {(x,y) | T*(x,y) € Ag, for all k <n}. 


Our next goal is to write an explicit formula for such triangles. We begin a brief 
geometric description of triangle sequences. One can consider the triangle sequence 
for (a, B) as a method to construct a sequence of integer vectors (C;,,) whose ele- 
ments are “almost orthogonal” to the vector (1,a,B), ie., the dot product of the 
vectors in the sequence with the point (1,0, B) in R* is small. Set 


1 0 0 
C3=|0], Co=l1], ci=]o 
0 0 1 


Suppose that the triangle sequence for (a, B) is (a;). Then we set by definition 
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Cy = Cy_3 — Cy_-2 — C1 - 


Define also 
Xe = Cy X Ceti, 
where v x w denotes the cross product of two vectors v and w in R?. 


Theorem 27.31. The vertices of the triangle A(ao,...,@y) are the points Ney, Rep 
and X,4+Xy_2. 


As is usual for continued fraction algorithms, the proof is obtained by induction 
(see the proof in [15]). 


27.2.4.4 Convergence of triangle sequences 


A triangle sequence does not always converge to a unique number. However, it does 
so under certain conditions. 

Consider an infinite triangle sequence (ao,a1,a2,...) of nonzero elements. Let 
the triangle A(ag,...,a,) have vertices X,,_, X,, and X,+X,,_2. Set 


d(X,—1,Xn41) 
d(Xn-1 Xn 4+Xn_2) : 


= 


where d(Yj, Y2) is the Euclidean distance between the points Y; and Y. 
We have the following theorem. 


Theorem 27.32. Assume that (ay) is a triangle sequence of nonzero elements. Then 
the triangle sequence describes a unique pair (a, B) when 


[[(—4n) =0. 
N 


n=. 


Let us now give a criterion of nonconvergence of a triangle sequence. 


Theorem 27.33. Consider a sequence (ay). Suppose that there exists N such that 
an > 0 for any n > N and 


© 


[[G-4:) > 0. 


n=N 


Then the triangle sequence (ay) does not correspond to a unique point. 


We refer to [15] for the proof of the above two theorems. Weak convergence is 
discussed in [150]. 
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27.2.4.5 Algebraic periodicity 


One of the most challenging properties that one thinks of while generalizing con- 
tinued fractions to the multidimensional case is algebraic periodicity. If a triangle 
sequence is periodic, then the corresponding pair of real numbers correspond to cer- 
tain solutions of a cubic equation with integer coefficients. Although the converse 
statement is not true, there is the following nice result announced in [50], which 
seems to improve the situation. 

For simplicity we embed the triangle A in R*, simply by adding the first coordi- 
nate 1 and denoting the resulting triangle by A. The vertices of A are 


1 1 1 
y={/0)}, w= [1], w= {tl 
0 0 1 
Now we define three important matrices: 
001 101 111 
Ap={100], Ar={0O10], B= {011 
O11 001 001 


We embed the permutation group on three elements into GL(3, Z) as a subgroup 
of matrices permuting the columns. 


Definition 27.34. For an arbitrary triple of column permutations (0, 1,7) we de- 
fine the following two matrices: 


Fo.o,% =OA0T and Fi.e¢.7, = OAT. 
Denote by A;(0, 7%, 71) the image of A under the map ine Fo,6,1: 


We use M’ to denote the transpose matrix to M. 


Definition 27.35. Let us define the map Tg.,z,,c, for an arbitrary triple of column per- 
mutations 0,7), 7. Consider a pair (x,y) € A. Suppose that (x,y) € Ax (6, T, 71). 
Then we define 

(a,b,c) = (1,x,y)- (BF g, oe k Bo ae 


1,0,7 


We define To ,1,1, (x,y) as follows: 


be 
To,19,11 (x,y) > (-. “). 


Remark 27.36. We have actually defined 216 different maps Gonesponduls to dif- 
ferent triples of permutations. Notice that the matrix (BF ¢ meet . 1B '\! is in 


SL(3,Z). The triangle partition maps To,z),1, are called TRIP maps. 
Finally, we describe the following three classes of maps, where n € Zso: 


1 n, 
Class 1. Te 66° (Te493) 123) 2 


420 27 Other Generalizations of Continued Fractions 


n 
Class 2. Te (23),¢° (Tei (123),123)) ; 


Class 3. T(93),(23),¢ © (7113), (12,6) * 

Notice that for each choice of a class and a parameter n we get a different 
algorithm defined by the composition 7 of the corresponding n+1 TRIP maps. 
Similar to Definition 27.35, the composed map 7 sends the triangle A to the set 
A (my ymp,...,mp,me)> Which is encoded by n+1 integer parameters appearing in the com- 


position if: 

Definition 27.37. Let (x,y) € A. Fix a composed map 7 of one of Classes 1, 2, 
3 with a fixed parameter n. For an arbitrary integer k we denote by a the (n+1)- 
dimensional vector of nonnegative integers such that Tee, y) isin Ag,. We say that 


(a;) is the triangle sequence for (x,y) with respect to T . 


Now we are ready to formulate an interesting theorem announced in [50] in 
which the authors in some sense cover all cubic irrationalities by periodic triangle 
sequences of several continued fraction algorithms in the following way. 


Theorem 27.38. Let K be a cubic extension of Q. Suppose that the element u € 
Ox satisfies 0 <u <1. Then either (u,u’), (u?,u*), (u,u? —u), (u?,u? — ut), 
(uu, (uu)? —uu'), or ((uu!)?, (uu')? — (uu')*), has a periodic triangle sequence un- 
der some composed map T in Class 1, 2, or 3 (where u' is one of the conjugates of 


u). 


The following conjecture is given for the multidimensional case. 


Conjecture 38. (T. Garrity.) For each positive integer d, there is a finite num- 
ber of multidimensional continued fraction algorithms, so that for every d-tuple 
(E1,---,6a) with Q(&,,...,&7) a degree-d algebraic number field, there is a multi- 
dimensional continued fraction algorithm in the family spanned by the initial algo- 
rithms such that it generates a periodic output. 


Remark 27.39. Farey fractions give rise to a thermodynamic approach to real num- 
bers (for more information we refer to [121], [62], [122], and [68]). Triangle se- 
quences are used to generalize this approach to the case of pairs of real numbers 
[69]. 

Remark 27.40. There are several other generalizations in the spirit of triangle se- 
quences. For instance, the classical Minkowski ?(x) function (see [152], [82], [212], 
[218], etc.) was generalized to the two-dimensional case by O.R. Beaver and T. Gar- 
rity in [21] (see also [171)). 


27.3 Decompositions of coordinate rectangular bricks and 
O’Hara’s algorithm 


Surprisingly, the Euclidean algorithm arises in the theory of partitions related to 
O’Hara’s algorithms, introduced by K.M. O’Hara in [164] (see also [165]). In this 
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section we briefly describe the geometric approach to O’ Hara’s algorithm via special 
decompositions of parallelepipeds. In particular we describe their relation to regular 
continued fractions and their generalizations. For simplicity we consider here only 
the finite-dimensional case. For the general infinite-dimensional case for permuta- 
tions of Z, and further information on the finite-dimensional case, we refer to [125] 
and [169]. 


27.3.1 I1-congruence of coordinate rectangular bricks 


We begin with several general definitions. If a polyhedron P is a disjoint union 
of a finite or countable number of convex polyhedra P),P>2,..., then we say that 
{P,P2,...} is a decomposition of P. 


Definition 27.41. Let 2 be a linear subspace of codimension one in R”. We say 
that two convex polyhedra P and Q in R” are 2-congruent if there exist decompo- 
sitions {P,...,P,} and {Qi,...,Q,} of P and Q respectively such that for every 
i€ {1,...,k} there exists a translation 7; along a vector in 7 taking P,; to Q; (i.e., 
T;(P;) = Qi). 

Two convex polyhedra are said to be II-congruent if there exists a space 7 such 
that these polyhedra are 7-congruent. 


Definition 27.42. If instead of finite decompositions of polyhedra we have count- 
able decompositions, then the corresponding polyhedra are called asymptotically 
m-congruent (respectively asymptotically II-congruent). 


In this section we are interested only in the coordinate parallelepipeds defined as 
R(a1,.--,;4n) = [0,a,) x --- x [0,an). 


We call them coordinate bricks. 


Example 27.43. Let us consider two bricks R(6,11) and R(22,3). We show their 
decompositions in Fig. 27.5 simultaneously. Each of the bricks is subdivided into 
six smaller rectangles labeled by Roman numerals. The rectangle with label n is 
shifted to the rectangle with label n’. For instance, the gray rectangle labeled by V is 
taken to the gray rectangle labeled by V’. Here all the translation vectors are in the 
linear space x + 2y = 0. 


In the next two subsections we answer the following two natural questions: 
(a) Which coordinate bricks are (asymptotically) TI-congruent? 
(b) How does one construct the decompositions and translations showing the 
(asymptotic) IT-congruence of such boxes? 
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Fig. 27.5 A decomposition of bricks R(6, 11) and R(22,3). 


27.3.2 Criteron of II-congruence of coordinate bricks 


We begin with the following definition. 


Definition 27.44. Two coordinate bricks P = R(a1,...,a,) and Q = R(by,...,bn) 
are called relatively rational if there exists A 4 0 such that AP and AQ are integer 
regular parallelepipeds. 


In an arbitrary dimension n the following general theorem holds. 


Theorem 27.45. (i) Two coordinate bricks of the same dimension are asymptotically 
II-congruent if and only if they have the same volume. 

(ii) Two relatively rational rectangular coordinate boxes are TI-congruent if and 
only if they have the same volume. 


In addition we have the following simple formula to calculate the equation of the 
plane z. 


Proposition 27.46. Let R(a\,...,dn) and R(bj,...,bn) be m-congruent. Then the 
hyperspace 7 is uniquely defined by the equation 
a a\a2 a,...dn-1 


1 
! boutd =. 
A ppg oor eee pa 


For a proof of Theorem 27.45 we refer to [125]. 
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27.3.3 Geometric version of O’Hara’s algorithm for partitions 


The answer to the second question (on construction of decompositions and trans- 
lations) is given by O’Hara’s algorithm, which comes from the theory of general 
partitions (we refer the interested reader to [5], [165], [164], [168], [169]). In this 
subsection we consider only the geometric representation of the algorithm related 
to continued fractions. In some sense the idea of this algorithm is an algorithm- 
definition, giving a constructive description of all the translations 7;: for an arbitrary 
point of the first coordinate brick the algorithm constructs the image of this point in 
the second coordinate brick. The resulting decomposition is a decomposition into 
boxes, all of whose edges are parallel to coordinate vectors. 


O’Hara’s algorithm in a geometric form. 


Predefined parameters of the algorithm. We are given two coordinate bricks 
R(a, Sits ,Qn) and R(by,. we ,bn). 


Input data. A point v € R(a,...,dn). 
Aim of the algorithm. To construct the point w € R(b,,...,b,) corresponding to v. 


Step 1. Put vy =v. 


Recursive step k. We have constructed a point vz_ 1. If vg_, is contained in the brick 
R(by,..-,bn), then the algorithm terminates and we put w = v,_1. Suppose now that 
Vg_1 is not in the brick R(b1,...,b,). Then there exists an index j such that the jth 
coordinate of vz_, is not less than b;, allowing us to recursively define 


Vk = Vk~1 —bjej +aj-1ej-1 
(here e; and e;_, are the corresponding coordinate vectors whose indices are in the 
group Z/nZ). 


Output. The algorithm returns a vector w € R(bj,..., bn). 


Remark 27.47. It turns out that the result w does not depend on the choice of the co- 
ordinate j in any recursive step, but entirely on the input data v (see [164] and [165] 
for more details). 


Now we are ready to define a mapping between the coordinate bricks. 


Definition 27.48. Let P and Q be two coordinate bricks of the same volume in R”. 
We define a bijection @pg : P + Q at every v € P as follows: 


9po(v) =, 


where w is the output of the algorithm with the input data v. 
Observe the following property of @pg. 


Proposition 27.49. The function @pg is a piecewise linear bijective function be- 
tween the coordinate bricks P and Q. 
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Due to piecewise linearity we have the following natural decomposition of the 
coordinate bricks. 


Definition 27.50. Let P and Q be two coordinate bricks of the same volume in 
IR”. Consider a natural decomposition of the coordinate brick P into regions on 
which the function @pg is linear. Define a decomposition of Q as the image of the 
decomposition of P via the map @pg. We say that these decompositions of P and Q 
are associated to the bijection pg. 


In the general case the described decompositions have a countable number of 
boxes. However, when P and @Q are relatively rational, the decompositions of P and 
Q are finite. 


The following proposition gives an answer to the second question of this sec- 
tion: how does one construct the decompositions and translations showing the 
(asymptotic) IT-congruence of such boxes? 


Proposition 27.51. Let P = R(aj,...,d,) and Q = R(aq,...,dn) be asymptotically 
II-congruent coordinate bricks. Then P and Q are asymptotically ™-congruent, 
where the hyperplane 7 is defined by the equation 


qe 4 21a " seeds 
XxX —X —— XxX eee a ’ 
by" bobs by...Dn 

In addition, the decompositions of P and Q associated to the bijection @pg estab- 
lish the asymptotic ™-congruence. The restriction of @pg to parallelepipeds in the 
decomposition of P identifies all the translation vectors. 


Here we notice again that in the case of relatively rational coordinate bricks P 
and Q, one has z-congruence instead of asymptotic z-congruence. 


Remark 27.52. In the special case of rectangular bricks R(a,b) and R(b,a) with 
relatively prime positive integers a and b and the space x = {x + y = 0}, O’Hara’s 
algorithm is a version of the Euclidean algorithm. Geometrically the elements of the 
continued fraction for b/a correspond to the numbers of equivalent squares in the 
corresponding layers of the decomposition of R(a,b). 


Example 27.53. We illustrate this with the example of a = 9 and b = 25. The as- 
sociated decompositions of R(9,25) and of R(25,9) consist of two large squares, 
one average square, three small squares, and two very small squares. This can be 
read from the regular continued fraction: 2 = [2;1:3: 2]. In Fig. 27.6 we show the 
layers of equivalent squares as bold rectangles. 


Remark 27.54. Notice that after a coordinate rescaling the decomposition remains 
combinatorially the same. Therefore, we can always choose the coordinate scale to 
work with R(1,b) and R(b, 1), where x = {x+y = 0} for some b > 0. 

From the above two remarks it follows that the notion of partitions in the two- 
dimensional case correlates with the notion of regular continued fractions. So we 
conclude with the following natural problem. 
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3 Ey 
) 


Fig. 27.6 A decomposition of bricks R(9,25) and R(25,9). 


Problem 39. (I. Pak [169].) Find a relation between multidimensional continued 
fractions and the geometry of the bijection constructed by O’ Hara’s algorithm. 


27.4 Algorithmic generalized continued fractions 


We have already considered two algorithms that generalize the Euclidean algorithm: 
O’Hara’s algorithm and triangle sequences. There are in fact many other different 
generalizations of the Euclidean algorithm in the spirit of the two mentioned above. 
No geometric interpretation is known for most of them. To give some impression 
of the algorithmic approaches to continued fractions we mention here several other 
algorithms of this nature. We are not planning to go into details of these approaches, 
since it is far away from the scope of this book. The best reference to the algorithmic 
approach to continued fractions is the book [195] by F. Schweiger (see also [28] 
and [210]). First we give a general algorithmic scheme in which such algorithms 
can be realized (for additional details we refer to the preprint [196] of F Schweiger). 
We then show some examples of known algorithms. Finally, we briefly discuss some 
problems related to algorithmic generalizations of continued fractions. 


27.4.1 General algorithmic scheme 


Most of the generalized Euclidean algorithms work according to the following 
scheme. 


Scheme of a generalized Euclidean algorithm. 
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Predefined parameters of the algorithm. Consider the space R” for some integer 
n. The following data specifies each algorithm: 

(i) the domain A C R” on which the algorithm is defined; 

(ii) the operation of subtraction S: R" > R"; 

(iii) the operation of inversion T : R” > R"; 
(iv) the termination domain A C A, at which the algorithm terminates. 
We give examples of 4-tuples (A,S, T,A) below. Notice that it is usually required 
that 

ToS(A) CA. 


Input data. A real vector v € A. 
Aim of the algorithm. To construct the generalized continued fraction. 
Step 0. Put vo = v. 


Recursive step k. We have constructed vy_1. If vg € A, then the algorithm termi- 
nates. If vy ¢ A, the recursive step consists of three substeps: 

Substep k.1: perform an operation of subtraction S; 

Substep k.2: invert the result via the operation of inversion T; 

Substep k.3: write the (kK—1)th element of the continued fraction based on Sub- 
step k.1 and Substep k.2. 
We conclude the step by putting 


Vp = TO S(vg_1). 


Output. If the algorithm stops (say at step m-+ 1), then the last element v,, is consid- 
ered a generalized greatest common divisor. The corresponding continued fraction 
is generated in the iterative steps; it is calculated by parameters of subtractions S 
and inversions T. 


27.4.2 Examples of algorithms 


First we write the classical Euclidean algorithm in the framework of the general 
algorithmic scheme. We will then give several examples of generalizations. 


Example 27.55. Euclidean algorithm. Predefined parameters of the algorithm are 


as follows: 
A= { (x,y) |x>y>Osx,y€ R}; 


S: (xy) ++ (x— LE,y)5 


T : (x,y) + (y,x); 
A = {(x,0) |xeZ,}. 
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In Substep k.3 we remember the element ax_; = ea , where vy_1 = (Pr_-1,9x-1) is 


the vector defined in Step k—1. The resulting continued fraction is 
[a03;@1 :a2:-+°]. 


Example 27.56. Jacobi—Perron algorithm ((92], [178], [177]). Historically, the 
first algorithmic generalization of regular continued fractions is the Jacobi—Perron 
algorithm (see also [191]). We put 


A = {(x0,41,---,%n) | x0 = xj 20,1 <i <n}; 
8: (x0.X1 00-5) > (40— LL tise — BJs --5n— L3N) 5 
T : (X0,%1,-+-,Xn) > (%1,---5%n,X0)5 


A 


A = {(x0,%1,---,Xn) [x1 =O}NA. 


In Substep k.3 we remember the n-dimensional vector 
— (| %0,k-1 | | 24-1 Xnk-1 
ak-1 = | | RSE czas | eas ’ 
Xp k-14 0 |X k-1 X1k-1 
where vy_—1 = (X0,4-1,41,4-1,-+-Xn.k—1) is the vector in R"*+! defined in Step k—1. 


The resulting generalized continued fraction here is the sequence of n-dimensional 
vectors 


(ag ,@1,Q2,...). 


Remark 27.57. If the Jacobi—Perron algorithm terminates, one actually can remove 
the zero coordinate x; = 0 and continue the algorithm in the real space whose di- 
mension is one less (finally we end up at n = 1). 


Example 27.58. Generalized subtractive algorithm ((192]). 
The generalized subtractive algorithm is defined by the following data. First, p is 
fixed for the algorithm. Second, we put 


A = {(x0,41,-+-5%n) | xo > x4 ees aig 2 OF; 

S: (x0,*1, sat Xn) > (x0 Xp X1,++- sXn)3 

T is a permutation of coordinates putting them in nonincreasing order; 
A = {(x0,X1,---,X%n) | Xp =O}NA. 


We remember the permutation that was used in Substep k.2. Indeed, we always 
exchange the first element with some element a,_; € {0,1,...,}. So we can choose 
a sequence of integers 

(ao ,41,42, os i) 


as the generalized p-subtractive continued fraction. 


Remark 27.59. The first subtractive algorithm was introduced in [30] by V. Brun, 
where he considered the case p = |. Later, R. M6nkemeyer in [153] (and further 
E.S. Selmer in [198]) studied the subtractive algorithm for the case p =n. 
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Example 27.60. Fully subtractive algorithm ([193], [194]). In the fully subtractive 
algorithm we also fix p and consider 


A = {(X0,%1,---%n) | x0 = x1 S++ > Xn > O}; 

St (%0,X1,++-,Xn) > (X0 — Xp, X1 —Xpy---)Xp—-1 —XpXpy-+Xn)} 

T is a permutation of coordinates putting them in nonincreasing order; 
A ={(x0,%15«+<)%n) |Xp=OFNA. 


We remember the permutation that was used in Substep k.2, denoting it by ax_1. 
The generalized continued fraction is the sequence of permutations (ao,a1,q2,...). 


27.4.3 Algebraic periodicity 


Let us mention here a famous conjecture known as Jacobi’s last theorem. 


Problem 40. (Jacobi’s last theorem.) Let K be a totally real cubic number field. 
Consider arbitrary elements y and z of K satisfying 0 < y,z < 1 such that 1, y, and 
z are independent over Q. Is it true that the Jacobi—Perron algorithm generates an 
eventually periodic continued fraction with starting data v = (1,y,z)? 


This problem is open not only for the Jacobi—Perron algorithm but also for other 
similar algorithms. The converse statement is true. 


Proposition 27.61. Periodic algorithmic continued fractions correspond to certain 
algebraic irrationalities. 


27.4.4 A few words about convergents 


Let us briefly explain how to generate convergents for a vector v with a generalized 
continued fraction (ao, q) ...). 

This is usually done by inverting the map T oS. First of all, notice that the values 
in the image of the map (To S)~! are enumerated by the possible values of the 
elements of the corresponding continued fractions, so we can write (To S)7! to 
indicate which of the images should be taken. 

One starts with some vector wo. It is natural to choose wo either (1,0,...,0) or 
the generalized greatest common divisor (in case of existence). Then the vectors 


wi =(ToS);) (wo), w2=(ToS)z' (wi), w3 = (ToS)z' (wa), 


are called the convergents of v. 


Remark 27.62. In questions of approximation it is natural to reduce projectively the 
first coordinates: 
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o xX] Xn 
p= (xo,.-+,%n) a p=(=.....). 
xo Xo 
Now arises the question of the quality of approximation of the vector ¥ by the vectors 
w;. For further information on approximation aspects we refer to [195]. 


27.5 Branching continued fractions 


In this subsection we observe some interesting representations of real numbers as 
ratios of certain determinants of infinite matrices related to continued fractions or 
their special generalizations. It is nice to admit that for every algebraic number there 
exists a periodic representation defining the corresponding number. Matrix represen- 
tations do not give a complete invariant of algebraic numbers in the following sense: 
any real number has many different representations, and it is quite hard to determine 
whether two of them define the same number. For further information and references 
we refer to the works of V.Ya. Skorobogatko [204], [205], V.I. Shmojlov [201], 
[202], [203], I. Gelfand and V. Retakh [73], [71]. 

We start with a representation of real numbers by tridiagonal determinants whose 
elements are the elements of the following continued fractions (for further details 
we refer to [94]): 


aiyaj20 0 O 
—lana30 0 
0 —1 a3 a34 0 
0 0 -l 44 445 ..-- 
0 0 0 -l a55 --- 


| = 
ae ao + ae a22 a23 0 0... ; ery) 
433 + E —1 a33 a34 0 
OU eit, O —1 dag gs... 
0 0 -l a55 --- 


Equation (27.1) holds for any continued fraction whose sequence of convergents 
converges to a real number. Here we use dots in matrices to denote the limit of the 
ratios of determinants of the corresponding (n+1) x (n+1) matrix and n x n matrix 
as n tends to infinity. We will also do the same in the above formula. In the case of 
equation (27.1) we have an even stronger statement for n-convergents: 
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a\1 412 0... 0 0 
—laz2 ax... 0 0) 
0 —-lay... 0) 0) 
0 0 0 eee Qn—-1,n—-1 Qn—-1,n 
ee a\2 _ 0 0 0... -! Ann 
me at 423 lang a7... 0 0 
22 An-1,n—-1 1 
Ae eS —lay... 0 0 
ann 


0 0 «++ An—1,n—1 Gn-1,n 
0 0 


—l ann 


Let us give the definition of Hessenberg continued fractions. 


Definition 27.63. Consider a real number x and a sequence of real numbers qj. 
Suppose that 
—a, —a2 —a3z —ay4... 
—1 —a, —az -a3... 
QO —1 —a; —a... 


0 0 —1 —a| ... 
— : 
—d,| —d2 —a@... 
—l1 -—a, —a... 
0 -1l -q,... 


i.e., the limit from the right exists and equals x. Then the expression from the right 
is called the Hessenberg continued fraction for x. 


The most interesting case here occurs when a sequence (a;) has only finitely 
many nonzero entries. 


Proposition 27.64. Let (a;) be a sequence of real numbers with finitely many 
nonzero elements, assuming that the element day is the last nonzero element. Suppose 
that the Hessenberg continued fraction for the sequence (a;) converges to some real 
number x. Then x satisfies the equation 


XM tayx | 4x? +--+ +a__1x+ay = 0. 


In the particular case of cubic equations we have an additional nice formula, 
which could be considered a generalized two-dimensional regular continued frac- 
tion. 


Proposition 27.65. Suppose that a Hessenberg continued fraction for a real number 
x is defined by a sequence (a1, a2,43,0,0,...). Then we have 
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a3 
a2 7} 
a2 — ——— 
a, — 
a, — a= 
a, — 
os a — 
xX=—-a,+ a3 
ag — a — 
a, — 
aj —... 
at a3 
az — ———— 
a, — 
a, — a) — 
a, — 
a, — 


Later A.Z. Nikiporetz gave the following generalization of Hessenberg continued 
fractions. 


Definition 27.66. Consider a real number x, and a two-sided sequence of real num- 
bers (a;). Suppose that 


ak Ak+1 ~Ak+2 —Ak+3 --- ak] ak Ak+1 —~4k+2 --- 
ak-1 ak Qk+1 —Ak+2 «-- Ak—-2 —Ak-1 ak Ak+1 --- 
Ak—2 —Ak-1 ak Ak+1 +++ Ak—3 —Ak—2 —Ak-1 ak 
Ak—3 —Ak—2 —Ak-1 an... Ak—4 —Ak—3 ~Ak—-2 —Ak_-1 --- 

= ‘ “ ‘ oe 
aK —Ak+1 ~Ak+2 --- —axk-1 Ak Ak+1 -- 
—axk-1 Ak Ak+1 +. —Ax-2 —~Ak-| Ak 
—An_2 —Ag_1| —Ak ... —Ap—3 —Ag—_2 —Ag_] .«.. 


—Ag_-3 —Ag—-2 —A-1 ... 


i.e., the limit from the right exists and equals x;,. Then the expression from the right 
is called the Nikiporetz continued fraction for x. 


In analogy to Proposition 27.64 for Hessenberg continued fractions we get the 
following statement for Nikiporetz continued fractions. 


Proposition 27.67. Let (a;) be a two-sided sequence of real numbers with finitely 
many nonzero elements. We assume that all elements with negative indices are zeros, 
dg = I, and the element ay is the last nonzero element. Suppose that the Nikiporetz 
continued fraction for the sequence (a;) converges to some real number xx. Then x, 
satisfies the equation 


Beta! + agtt* $-6 +n1XE +n =0. 


Notice that in the particular case k = | we have the statement of Proposi- 
tion 27.64. 
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We continue with Skorobogatko continued fractions. 


Definition 27.68. The expression 


a\ a2 
a=bo+ 


i 
QB a4 ; as a6 
aes ol Gee aas Pha Heda 


bits, 


is called the Skorobogatko continued fraction for . 


Proposition 27.69. The Skorobogatko continued fraction for Q@ as in the previous 
definition is written in determinant form as follows: 


bo aj a2 0000... 
-l1b 0 aja, 0 0... 
-1 0 bb 0 O asa... 


0-10 0b,0 0 
0 0-10 0b5 0 
0 0 -10 0 O be. 


Finally, we would like to mention another generalization of continued fractions 
for the case of noncommuting elements and its expression in terms of quasidetermi- 
nants studied by I. Gelfand and V. Retakh in [73], [71] (see also in [170]). Here we 
formulate only a consequence for the case of commutative elements (for the non- 
commutative case we refer the interested reader to the original papers mentioned 
above). 


Definition 27.70. Consider a real number x and a collection of real numbers a; ;. 
Suppose that 

G11 412 413 414... 

—1 dy2 a93 24... 

0 —1 a33 a34... 

0 O —lag... 


a22 423 424... 
—1 a33 434... 
0 —-lag... 
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i.e., the limit from the right exists and equals x. Then the expression from the right 
is called the quasideterminant for x. 


We mention once more that the notion of quasideterminant in the case of non- 
commutative elements a;,; is more complicated; see [72] and [73]. 


Proposition 27.71. In the setting of the above definition, we have 


1 


xX=ayt+ y.. A jy 
Al a ae 
ALJ 


1 
2 a3;, F... 
i 3j2 


1 


27.6 Continued fractions and rational knots and links 


In this section we explain how continued fractions are used in topology for the 
classification of rational knots and links. For additional information and references 
we refer to [114] and [115]. 


27.6.1 Necessary definitions 


Recall that a knot is a smooth embedding of a circle R/Z into R*; a link is a smooth 
embedding of several circles into R?. 


Definition 27.72. We say that a knot K has an n-bridge representation if K is isotopic 
to some knot having only n maxima and n minima as critical points of the natural 
height function on K given by the z-coordinate in R?. 

The bridge number of a knot K is the minimal number n such that K has an 
n-bridge representation. 

We say that a knot of bridge number n is an n-bridge knot. 

A 2-bridge knot is said to be a rational knot. 


For the study of n-bridge knots it is natural to consider n-tangles. 


Definition 27.73. An n-tangle is a proper embedding of the disjoint union of n arcs 
into a three-dimensional ball with 2n marked points such that the endpoints of the 
arcs map to distinct marked boundary points. 


An n-tangle is called trivial if its endpoints are connected by straight lines. 
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27.6.2 Rational tangles and operations on them 


Definition 27.74. We say that a 2-tangle is a rational tangle if there exists a smooth 
deformation of the three-dimensional ball under which the 2-tangle evolves to the 
trivial 2-tangle. 


Graphically, tangles are represented as special regular mappings of the arcs to 
the plane (i.e., having only finitely many singular points, and such that all these 
singularities are double crossings). Two pairs of endpoints are mapped to the lines 
y =O and y = | respectively, and the rest are mapped to the band bounded by these 
lines. At each crossing we indicate which of the two branches of the double crossing 
is above and which is below. (See an example of a tangle in Fig. 27.7.) 


Basic tangles. First we define the following three tangles: 


a ee ree a 
roy=! rei X! re) | |! 
by amen Ti i Ae i ee 


Let us now define the addition, multiplication by —1, and inversion operations 
on 2-tangles. 
The sum operation. The sum of two tangles 7; and 7> is the tangle T; + 7> defined 
as follows: 


Multiplication by —1. If we change all the crossings in the tangle T, then we get 
the tangle called opposite to T and denoted by —T. 


The inversion operation. Let T be an arbitrary 2-tangle. The inverse tangle T' is 
defined as follows: 


Definition 27.75. Let m be a positive integer. We define 
T(m) =T(1)+---+T(1). 
S"__—__ 
m 


Letting [aq;a) : a2 : -++ : dy] be a continued fraction with integer coefficients, then 
set 


T ([aosa1 Ldgieet an) =T(ao)+ (T (a1) + (T (az) + (0+ TH(aq)) VY. 
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Fig. 27.7 A rational tangle T([4;2 : 3]). 


As we will see later, in Theorem 27.76, the isotopy class of a tangle T(”) does not 
depend on the choice of the integer elements of the continued fractions representing 
the fraction a 


27.6.3 Main results on rational knots and tangles 


The next theorem shows that the set of rational tangles is in a natural one-to-one 
correspondence with the rational numbers. This theorem is a reformulation of the 
Conway theorem from [42]. 


Theorem 27.76. (i) Two tangles T (ao; : az: +++: An|) and T ([bo3b1 2. bz: +++: bm]) 
for continued fractions with integer coefficients are isotopic if and only if 


[a0341 22 :+++ 2 Ay] = [bos by 2 ba: +++: Dy]. 


(ii) For every rational tangle T there exists a rational number : such that T is 


isotopic to T(f). 


To get a knot from a tangle one should use the following closing operation. Each 
tangle diagram has two endpoints on the line y = 0. Connect them by some curve that 
does not intersect the diagram. Do the same for the two endpoints on the line y = 1. 
See an example of a tangle T (3) and the corresponding trefoil knot in Fig. 27.8. 


Theorem 27.77. (H. Schubert [190]) Let re and Ps be two rational numbers sat- 
isfying gcd(p1,q1) = gcd(p2,q2) = 1. Then the corresponding knots or links K(f) 
and K(2) are isotopic if and only if the following two conditions hold: 


(a) pi = pr 
(b) either q, = q2(mod p,) or qiqz = 1(mod p;). 
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Fig. 27.8 A rational tangle and its closure. 


Remark 27.78. The classification of rational knots arises in the topology of three- 
dimensional manifolds. Namely, the 2-fold branch coverings spaces of S* along the 
rational knots and links represent lens spaces; see the book [197] for more details. 


So the problem of classification of rational knots is completely solved. The next 
step in this direction would be the classification of 3-bridge knots. 


Problem 41. Describe all 3-bridge knots and links. 


It is natural to suppose that one should use an appropriate generalization of con- 
tinued fractions to investigate this question. So we have an additional question on a 
natural extension of continued fractions. 


Problem 42. Find a generalization of continued fractions that classifies rational 3- 
tangles. 


Rational 3-tangles can be defined similarly to rational 2-tangles. Nevertheless, 
we leave some freedom in the definition in order to try to achieve the goals of Prob- 
lem 41. 
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